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VASCULAR ENDOTHELIAL GROWTH FACTOR-X 

The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
5 designated ^VEGF-X", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-X. 

Introduction 

10 Angiogenesis involves formation and proliferation of 
new blood vessels, and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair. 

15 Angiogenesis also features in the growth of human 

cancers which require continuous stimulation of blood 
vessel growth. Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis 
psoriasis and diabetic retinopathy. 

20 

Capillary vessels consist of endothelial cells which 
carry the genetic information necessary to proliferate 
to form capillary networks. Angiogenic molecules 
which can initiate this process have previously been 

25 characterised. A highly selective mitogen for 

vascular enothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et al., "'Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications''. Regulation of angiogenesis, by I.D. 

30 Goldberg and E.M. Rosen 1997 Birkhauser Verlag 

Basle/Switzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
4 6-4 9 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 

35 resistant to acidic pH and to heating and binds to 
immobilised heparin. 
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VEGF-A has four different forms of 121, .165, 189 and 
206 amino acids respectively due to alternative 
splicing. VEGF121 and VEGF165 are soluble and are 
capable of promoting angiogenesis, whereas VEGF189 and 
5 VEGF206 are bound to heparin containing proteoglycans 
in the cell surface. The temporal and spatial 
expression of VEGF has been correlated with 
physiological proliferation of the blood vessels 
(Gajdusek, CM. , and Carbon, S.J./ Ceil Physiol., 

10 139:570-579, (1989)); McNeil, P.L., Muthukrishnan, L., 
Warder/ E., D'Amore, P. A., J. Cell. Biol., 109:811- 
822/ (1989) ) . Its high affinity binding sites are 
localized only on endothelial cells in tissue sections 
(Jakeman, L.B., et al., Clin. Invest. 89:244-253 

15 (1989)). The growth factor can be isolated from 

pituitary cells and several tumor cell lines, and has 
been implicated in some human gliomas (Plate, K.H. 
Nature 359:845-348, (1992)). The inhibition of VEGF 
function by anti-VEGF monoclonal antibodies was shown 

20 to inhibit tumor growth in immune-deficient mice (Kirn, 
K.J./ Nature 362:841-844, (1993)). 

VEGF proteins have been described in the following 
patents and applications all of which are hereby 

25 incorporated by reference EP-0, 506, 477, WO-95/24473, 
WO-98/28621, WO-90/13649, EP-0, 476, 983, EP-0, 550, 296, 
WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, 
WO-98/36075, WO-98/840124 , WO-90/11084, WO-98/24811, 
WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, 

30 WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551. 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
as U VEGF-X", and the nucleic acid sequence encoding 
35 it, which has potentially significant benefits for the 
treatment of tumours and other conditions mediated by 
inappropriate angiogenic activity. 
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Summary of the Invention 

In the present application, there is provided a novel 
vascular endothelial growth factor, herein designated 
5 "VEGF-X", nucleic acid molecules encoding said growth 
factor, an expression vector comprising said nucleic 
acid molecule, a host cell transformed with said 
vector and compounds which inhibit or enhance 
angiogenesis . Also provided is the sequence of a CUB 
10 domain present in the sequence of VEGF-X which domain 
itself prevents angiogenesis and which is used to 
treat diseases associated with inappropriate 
vascularisation or angiogenesis. 

15 Detailed Description of the Invention 

Therefore, according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
encoding a VEGF-X protein or a functional equivalent, 

20 fragment, derivative or bioprecursor thereof, said 
protein comprising the amino acid sequence from 
position 23 to 345 of the amino acid sequence 
illustrated in Figure 10. Alternatively, the nucleic 
acid molecule of the invention encodes the complete 

25 sequence identified in Figure 10 and which 

advantageously includes a signal peptide to express 
said protein extracellularly. Preferably, the nucleic 
acid molecule is a DNA and even more preferably a cDNA 
molecule. Preferably, the nucleic acid molecule 

30 comprises the nucleotide sequence from position 257 to 
1291 of the nucleotide sequence illustrated in Figure 
9. In a preferred embodiment the nucleic acid is of 
mammalian origin and even more preferably of human 
origin . 

35 

In accordance with the present invention a functional 
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equivalent should be taken to mean a protein, or a 
sequence of amino acids that have similar function to 
the VEGF-X protein of the invention. 

5 Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
stringency conditions/ which conditions would be well 
10 known to those skilled in the art. 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are stable. 
The stability of hybrids is reflected in the melting 
15 temperature (Tm) of the hybrids. Tm can be 
approximated by' the formula: 

81.5°C+16.6(log 10 [Na + ]+0.41 (%G&C) -600/1 

20 wherein 1 is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The term ^stringency" refers to the hybridisation 
25 conditions wherein a single-stranded nucleic acid 

joins with a complementary strand when the purine or 
pyrimidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
favour homologous base pairing whereas low stringency 
30 conditions favour non-homologous base pairing, 

"Low stringency" conditions comprise/ for example, a 
temperature of about 37°C or less, a formamide 
concentration of less than about 50%, and a moderate 
35 to low salt (SSC) concentration; or, alternatively, a 
temperature of about 50°C or less, and a moderate to 
high salt (SSPE) concentration/ for example 1M NaCl. 
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"High stringency" conditions comprise, for example, a 
temperature of about 42°C or less, a formamide 
concentration of less than about 20%, and a low salt 
(SSC) concentration; or, alternatively, a temperature 
5 of about 65°C, or less, and a low salt (SSPE) 
concentration. For example, high stringency 
conditions comprise hybridization in 0.5 M NaHP0 4 , 7% 
sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65°C 
(Ausubel, F.M. et al. Current Protocols in Molecular 
10 Biology , Vol, I, 1989; Green Inc. New York, at 
2.10.3) . 

"SSC" comprises a hybridization and wash solution. A 
stock 20X SSC solution contains 3M sodium chloride, 
15 0.3M sodium citrate, pH 7.0. 

"SSPE" comprises a hybridization and wash solution. A 
IX SSPE solution contains 180 rnM NaCl, 9mM Na 2 HP0 4 and 
1 mM EDTA, pH 7.4. 

20 

The nucleic acid capable of hybridising to nucleic 
acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
25 nucleotide sequences according to the invention. 

The antisense molecule capable of hybridising to the 
nucleic acid according to the invention may be used as 
a probe or as a medicament or may be included in a 
30 pharmaceutical composition with a pharmaceutically 
acceptable carrier, diluent or excipient therefor. 

The term "homologous" describes the relationship 
between different nucleic acid molecules or amino acid 
35 sequences wherein said sequences or molecules are 

related by partial identity or similarity at one or 
more blocks or regions within said molecules or 
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sequences . 

The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
5 molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

Therefore, according to a further aspect of the 
present invention, there is provided a VEGF-X protein, 

10 or a functional equivalent, derivative or bioprecursor 
thereof, comprising an amino acid sequence from 
position 23 to 345 of the sequence as illustrated in 
Figure 10, or alternatively which amino acid sequence 
comprises the complete sequence of Figure 10. A 

15 further aspect of the invention comprises a VEGF-X 
protein, or a functional equivalent, derivative or 
bioprecusor thereof, encoded by a nucleic acid 
molecule according to the invention. Preferably, the 
VEGF-X protein encoded by said nucleic acid molecule 

20 comprises the sequence from position 23 to 345 of the 
amino acid sequence as illustrated in Figure 10, or 
which sequence alternatively comprises the sequence of 
amino acids of Figure 10. 

25 The DNA molecules according to the invention may, 

advantageously, be included in a suitable expression 
vector to express VEGF-X encoded therefrom in a 
suitable host. Incorporation of cloned DNA into a 
suitable expression vector for subsequent 

30 transformation of said cell and subsequent selection 
of the transformed cells is well known to those 
skilled in the art as provided in Sambrook et al. 
(1989) , molecular cloning, a laboratory manual, Cold 
Spring Harbour Laboratory Press. 

35 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
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the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
5 the components described are in a relationship 

permitting them to function in their intended manner. 
Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention- Thus, in a further 

10 aspect, the invention provides a. process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 

15 the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides . 

The vectors may be, for example, plasmid, virus or 
20 phage vectors provided with an origin of replication, 
and optionally a promoter for the expression of said 
nucleotide and optionally a regulator of the promoter. 

The vectors may contain one or more selectable 
25 markers, such as, for example, ampicillin resistance. 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 

30 binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a heterologous or 

35 homologous promoter for RNA polymerase II, a 

downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 
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ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well known in the art. 

: 5 Nucleic acid molecules according to the invention may 
be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

10 

In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular,, substitutions in cases which result in a 

15 synonymous codon (a different codon specifying the 

same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions. The term 
"nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 

20 given regarding base variations. 

The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 

25 the invention and preferably from 10 to 50 nucleotides 
even more preferably, the nucleic acid sequence 
comprise the sequences illustrated in Figure 3. These 
sequences may, advantageously be used as probes or 
primers to initiate replication, or the like. Such 

30 nucleic acid sequences may be produced according to 
techniques well known in the art, such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 

35 These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 



WO 00/37641 



PCT/US99/30503 



- 9 - 

formation between the probe and any nucleic acid in 
the sample. 

The nucleic acid sequences according to this aspect of 
5 the present invention comprise the sequences of 
nucleotides illustrated in Figures 3 and 5. 

According to the present invention these probes may be 
anchored to a solid support. Preferably, they are 

10 present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in situ on the array. (See Lockhart et 
al. f Nature Biotechnology, vol. 14, December 1996 

15 "Expression monitoring by hybridisation to high 

density oligonucleotide arrays". A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations, 

20 The nucleic acid sequences, according to the invention 
may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 

25 nucleotides to a region of the gene which is desired 
to be cloned/ bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which brings about amplification of the 

30 desired region, isolating the amplified region or 

fragment and recovering the amplified DNA. Generally, 
such techniques are well known in the art, such as 
described in Sambrook et al. (Molecular Cloning: a 
Laboratory Manual, 198 9) . 

35 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label. Suitable 
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labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
may be detected using known techniques per se. 

Advantageously, human allelic variants or 
polymorphisms of the DNA molecule according to the 
invention may be identified by, for example, probing 
cDNA or genomic libraries from a range of individuals, 
for example, from different populations. Furthermore, 
nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 
Sanger Dideoxy chain termination method, which may, 
advantageously, ascertain any predisposition of a 
patient to certain disorders associated with a growth 
factor according to the invention. 

The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Conservative amino 
acid substitution refers to a replacement of one or 
more amino acids in a protein as identified in Table 
1- Proteins or polypeptides according to the invention 
further include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said proteins or 
polypeptides- In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 
the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 
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preferably recombinant. 

The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
5 medicament for treating cancer or other diseases or 
conditions associated with expression of VEGF-X 
protein. 

Advantageously, the nucleic acid molecule or the 
10 protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
excipient therefor. 

15 The present invention is further directed to 

inhibiting VEGF-X in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 

20 based on binding of a polynucleotide to DNA or RNA. 

For example, the 5' coding portion or the mature DNA 
sequence, which encodes for the protein of the present 
invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 

25 A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al. Nucl. Acids Res., 
6:3073 (1979); Cooney et al., Science, 241:456 (1988); 
and Dervan et ai., Science, 251: 1360 (1991), thereby 

30 preventing transcription and the production of VEGF-X. 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF-X protein (antisense - Okano, 
J. Neurochem., 56:560 (1991); Oligodeoxynucleotides as 

35 Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 
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Alternatively, the oligonucleotide described above can 
be delivered to cells by procedures in the art such 
that the anti-sense RNA and DNA may be expressed in 
vivo to inhibit production of VEGF-X in the manner 
5 described above . 

Antisense constructs to VEGF-X, therefore, may inhibit 
the angiogenic activity of VEGF-X and prevent the 
further growth of or even regress solid tumours, since 
10 angiogenesis and neovascularization are essential 
steps in solid tumour growth. These antisense 
constructs may also be used to treat rheumatoid 
arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

A further aspect of the invention provides a host cell 
or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
20 method of producing VEGF-X, which comprises recovering 
any expressed VEGF-X from the host or organism 
transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
25 is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-X 
protein according to the invention. The term 
"transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
30 expression of VEGF-X or proteins having the same 

function and/or activity. The transgene, may include, 
for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
35 state. Preferably, the transgene comprises the 

nucleic acid sequence encoding the proteins according 
to the invention as described herein, or a functional 
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fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
5 a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 
10 deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

15 VEGF-X protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also forms part of the 
present invention. 

20 Antibodies to the protein or polypeptide of the 

present invention may, advantageously, be prepared by 
techniques which are known in the art- For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse or rabbit, with the 

25 polypeptide according to the invention or an epitope 
thereof and recovering immune serum. Monoclonal 
antibodies may be prepared according to known 
techniques such as described by Kohler R. and Milstein 
C.r Nature (1975) 256, 495-497, Advantageously, such 

30 antibodies may be included in a kit for identifying 

VEGF-X in a sample, together with means for contacting 
the antibody with the sample. 

Advantageously, the antibody according to the 
35 invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other diseases associated with expression of VEGF-X. 
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The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutically acceptable carrier diluent or 
excipient therefor. 

5 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
interactions using the two-hybrid vector system first 
proposed by Chien et al., (1991) Proc. Natl. Acad. 
10 Sci. USA 88 : 9578-9582. 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 

15 comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DMA 

20 sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 

25 a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 

30 investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

35 An example of such a technique utilises the GAL 4 

protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 



WO 00/37641 



- 15 - 



PCT/US99/30503 



domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed/ one of which comprises the nucleotide 
5 residues encoding the DNA binding domain of GAL 4 . 

These binding domain residues may be fused to a known 
protein encoding sequence, such as for example, the 
nucleic acids according to the invention- The other 
vector comprises the residues encoding the protein 

10 binding domain of GAL4. These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 

15 reporter molecule in a GAL -4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
p-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

20 

A further aspect of the present invention also 
provides a method of identifying VEGF-X in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
25 any binding of any proteins to said antibody. A kit 
for identifying the presence of VEGF-X in a sample is 
also provided comprising an antibody according to the 
invention and means for contacting said antibody with 
said sample. 

30 

VEGF-X may be recovered and purified from recombinant 
cell cultures by methods known in the art, including 
ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, 
35 phosphocellulose chromatography, hydrophobic 

interaction chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
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chromatography . 

The VEGF-X protein of the present invention may be a 
naturally purified product/ or a product of chemical 
5 synthetic procedures, or produced by recombinant 

techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
10 polypeptides of the present invention may be 

glycosylated with mammalian or other eukaryotic 
carbohydrates or may be non-glycosylated . 

VEGF-X is particularly advantageous as a wound healing 
15 agent, where, for example, it is necessary to re- 

vascularize damaged tissues, or where new capillary 
angiogenesis is important- Accordingly, VEGF-X may be 
used for treatment of various types of wounds such as 
for example, dermal ulcers, including pressure sores,. 
20 venous ulcers, and diabetic ulcers. In addition, it 
can be used in the treatment of full-thickness burns 
and injuries where angiogenesis is desired to prepare 
the burn in injured sites for a skin graft and flap- 
In this case, VEGF-X or the nucleic acid encoding it 
25 may be applied directly to the wound. VEGF-X may be 
used in plastic surgery when reconstruction is 
required following a burn, other trauma, or even for 
cosmetic purposes. 

30 An important application of VEGF-X is to induce the 
growth of damaged bone, periodontium or ligament 
tissue. For example, it may be used in periodontal 
disease where VEGF-X is applied to the roots of the 
diseased teeth, leading to the formation of new bor.e 

35 and cementum with collagen fibre ingrowths. It can be 
used for regenerating supporting tissues of teeth, 
including alveolar bone, cementum and periodontal 
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ligament , that have been damaged by disease and 
trauma . 

Since angiogenesis is important in keeping wounds 
5 clean and non-infected, VEGF-X may be used in 

association with surgery and following the repair of 
cuts. It should be particularly useful in the 
treatment of abdominal wounds where there is a high 
risk of infection. 

10 

VEGF-X can also be used for the promotion of 
endothelialization in vascular graft surgery. In the 
case of vascular grafts using either transplanted or 
synthetic material, VEGF-X may be applied to the 

15 surface of the graft or at the junction to promote the 
growth of the vascular endothelial cells. One 
derivation of this is that VEGF-X can be used to 
repair the damage of myocardial and other occasions 
where coronary bypass surgery is needed by stimulating 

20 the growth of the transplanted tissue • Related to 
this is the use of VEGFX to repair the cardiac 
vascular system after ischemia. 

The protein of the present invention may also be 
25 employed in accordance with the present invention by 
expression of such protein in vivo, which is often 
referred to as v gene therapy". 

Thus, for example, cells such as bone marrow cells may 
30 be engineered with a polynucleotide (DNA or RNA) 

encoding for the protein ex vivo as defined herein, 
the engineered cells are then provided to a patient to 
be treated with the polypeptide. Such methods are 
well-known in the art. For example, cells may be 
35 engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding for the 
protein of the present invention. 



WO 00/37641 



- 18 - 



PCT/US99/30503 



Similarly, cells may be engineered in vivo for 
expression of the protein in vivo, for example, by 
procedures known in the art. 

5 A further aspect of the invention comprises a method 
of treating a disorder mediated by expression of a 
protein according to the invention, by administering 
to a patient an amount of an antisense molecule as 
described herein, in sufficient concentration to 
10 alleviate or reduce the symptoms of said disorder. 

Compounds which inhibit or enhance angiogenesis may be 
identified by providing a host cell or organism 
according to the invention or a transgenic cell, 

15 tissue or organism according to the invention, 

contacting a test compound with said cell, tissue or 
organism and monitoring for the effect of said 
compound compared to a cell tissue or organism which 
has not been contacted with said compound. These 

20 compounds may themselves be used as a medicament or 

included in a pharmaceutical composition for treatment 
of disorders mediated by inappropriate vascularisation 
or angiogenic activity. 

25 The present inventors have also, advantageously, 

identified in the sequence encoding the VEGF-X protein 
a CUB domain, which has heretofore not previously been 
identified in VEGF-type growth factors. The VEGF-X 
protein may therefore exert dual regulatory effects 

30 via interaction with the VEGF tyrosine kinase 

receptors or with neuropilin receptors mediated by the 
CUB domain. Thus, the sequence encoding said CUB 
domain may be included in an expression vector for 
subsequent transformation of a host cell, tissue or 

35 organism. 

VEGF-X or fragments thereof may be able to modulate 
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the effects of pro-angiogenic growth factors such as 
VEGF as indicated in the findings presented in the 
examples below that the N-terminal part of the VEGF-X 
protein, a CUB-like domain, is able to inhibit VEGF- 
5 stimulated proliferation of HUVECs. VEGF-X or 

fragments thereof may therefore be useful in therapy 
of conditions involving inappropriate angiogenesis. 
Inhibition of the angiogenic activity of VEGF has 
been linked with inhibition of tumour growth in 

10 several models eg Kim K. J. et al, Nature 362:841- 
844, (1993) . Additionally, agents able to inhibit 
angiogenesis would be expected to be useful in 
treating other angiogenesis-dependent diseases such a 
retinopathy, osteoarthritis and psoriasis (Folkman, 

15 J w Nature Medicine 1:27-31, (1995)* 

As identified in more detail in the Examples 
described herein the present inventors have 
surprisingly identified that the CUB domain of VEGF-X 

20 is able to inhibit stimulation of proliferation of 

HUVECs induced by either VEGF or bFGF. The CUB domain 
may, therefore, be utilised as a therapuetic agent 
for inhibition of angiogenesis and for treatment of 
condition associated with inappropriate 

25 vascularisation or angiogenesis. 

Therefore according to a further aspect of the 
invention there is provided a method of inhibiting 
angiogenic activity and inappropriate vascularisation 

30 including formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of a polypeptide having an amino acid 

35 sequence from position 40 to 150 of the sequence 

illustrated in Figure 10 or a nucleic acid molecule 
encoding the CUB domain according to the invention in 
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sufficient concentration to reduce or prevent said 
angiogenic activity. 

Furthermore there is also provided a method of 
5 treating or preventing any of cancer, rheumatoid 

arthritis, psoriasis and diabetic retinopathy, said 
method comprising administering to said subject an 
amount of a polypeptide having an amino acid sequence 
from position 40 to 150 of the sequence illustrated 
10 in Figure 10 or a nucleic acid molecule encoding the 
CUB domain according to the invention in sufficient 
concentration to treat or prevent said disorders. 

The CUB domain may also be used to identify compounds 
15 that inhibit or enhance angiogenic activity such as 

inappropriate vascularisation, in a method comprising 
contacting a cell expressing a VEGF receptor and/or a 
neuropilin 1 or 2 type receptor with said compound in 
the presence of a VEGF-X protein according to the 
20 invention and monitoring for the effect of said 

compound or said cell when compared to a cell which 
has not been contacted with said compound. Such 
compounds may then be used as appropriate to prevent 
or inhibit angiogenic activity to treat the disorders 
25 or conditions described herein, or in a 

pharmaceutical composition. An antibody to said CU3 
domain may also be useful in identifying other 
proteins having said sequences. 



35 
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Deposited PlasmidLa 

Date of Deposit Accession No. 

Plasmid VEGFX/pCR2.1 
5 1T0P0 FL 1 March 1999 LMBP 3925 

Plasmid VEGFX/pRSETB BD 

amino acids 1 March 1999 LMBP 3926 

10 G230-G345 

Plasmid VEGFX/pcR. 2. 1 

FL Clone 9 20 October 1999 LiMBP 3977 

15 Plasmid VEGF-X CUB 

PET22b 20 December 1999 

The above plasmids were deposited at the Belgian 
Coordinated Collections of Microorganisms (BCCM) at 
20 Laboratorium Voor Moleculaire Biologie- 

Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in 
accordance with the provisions of the Budapest Treaty 
of 28 April 1977. 

25 The invention may be more clearly understood with 
reference to the accompanying example, which is 
purely exemplary, with reference to the accompanying 
drawings, wherein : 

30 Figure 1: is a DNA sequence identified in the 

Incyte LifeSeq™ database coding for a 
novel VEGF-X protein. 



35 



Figure 2: 



is an illustration of amino acid 
sequence of the nucleic acid sequence 
of Figure 1. 
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Figure 3: 

5 

Figure 4 : 

10 

Figure 5: 

15 

Figure 6: 
20 Figure 1 : 

25 Figure 8: 
Figure 9: 

30 

Figure 10: 

35 

Figure 11: 
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is an illustration of PCR primer 
sequences utilised to identify the 
VEGF-X protein according to the 
invention* 

is a diagrammatic illustration of the 
spatial relationships in the VEGF-X 
sequence of the clones identified 
using the PCR primer sequences of 
Figure 3. 

is an illustration of the nucleotide 
sequences of the 5' RACE primers used 
to identify the 5' end of the VEGF-X 
open reading frame. 

is an illustration of the sequence 
obtained from the RACE experiment, 

is an illustration of the nucleotide 
sequences obtained from the search of 
LifeSeq™ database using the sequence 
in Figure 6. 

is an illustration of the primers used 
to clone the entire coding sequence of 
VEGF-X. 

is an illustration of the entire 
coding sequence of VEGF-X. 

is an illustration of the predicted 
amino acid sequence of the nucleotide 
sequence of Figure 9. 

is an alignment of the sequence of 
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5 

Figure 13: 

10 
15 

Figure 14: 

20 

Figure 15: 

25 

Figure 16: 



30 

Figure 17: 
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Figure 10 with the sequences of VEGF-A 
to D. 

is an illustration of variant 
sequences of the VEGF-X protein 
according to the invention. 

is an illustration of the 
oligonucleotide primers used for 
E*coli expression of VEGF-X domains 
and for expression of the full length 
sequence of VEGF-X in a 
baculovirus/insect cell expression 
system. 

depicts nucleic acid sequences of 18 
human EST clones obtained from a BLAST 
search of the LifeSeq™ database used 
to identify the full sequence encoding 
VEGF-X. 

depicts the nucleotide sequences of 50 
human EST clones obtained from the 
LifeSeq™ database. 

is an illustration of nucleotide 
sequences utilised as primers to 
identify the nucleotide sequence 
encoding VEGF-X, 

is a nucleotide sequence coding for a 
partial VEGF-X protein according to 
the invention • 

is an illustration of a partial 
nucleotide sequence encoding VEGF-X 
protein according to the invention. 
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Figure 19: 

5 

Figure 20: 

10 
15 

Figure 21: 

20 

25 Figure 22: 
30 



is an illustration of a DNA and 
polypeptide sequence used for 
mammalian cell expression of VEGF-X. 
The predicted VEGF-X signal sequence 
is in lower case letters. The O 
terminal V5 epitope and His6 sequences 
are underlined. 

is an illustration of a DNA and 
polypeptide sequence used for 
baculovirus/insect cell expression of 
VEGF-X, In the polypeptide sequence 
the signal sequence is shown in lower 
case. The N-terminal peptide tag 
added to the predicted mature VEGF-X 
sequence is' underlined. 

is an illustration of a DNA and 
polypeptide sequence used for £. coii 
expression of VEGF-X- The polypeptide 
sequences at the N- and C- termini 
derived from the MBP fusion and His 6 
tag respectively are underlined. 

illustrates the disulphide-linked 
dimerisation of VEGF-X. Protein 
samples were analysed by SDS-PAGE. 
Prior to loading the gel, samples were 
heated to 95°C for 5 minutes in sample 
buffer in the presence (+) or absence 
(-) of reducing agent. (A) samples 
from COS cell expression of a C- 
terminally V5/HisG peptide-tagged 
construct. The left hand panel is 
total conditioned medium, the right 
hand panel is material purified on 
Nickel agarose resin. Reduced monomer 
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10 
15 

Figure 23: 

20 



25 



Figure 24: 

30 



35 

Figure 25: 



- 25 - 

and putative disulphide-linked, non- 
reduced dimer are indicated by arrows. 
There appears to be proteolysis of the 
protein during purification. Gels were 
blotted onto nylon membranes and 
protein detected with an anti V5 
monoclonal antibody. (B) Samples from 
E.coli expression of a maltose-binding 
protein/His6 dual fusion construct. M 
indicates the molecular weight markers 
(Benchmark, Lif eTechnologies) . The 
gel was stained with Coomassie Blue by 
standard procedures. The fusion 
protein has an apparent molecular 
weight of 80kDa- 

illustrates the glycosylation of VEGF- 
X. VEGF-X was purified from the 
culture supernatant of COS cells 
transfected with the pcDNA6/V5-His 
construct. Supernatants were 
harvested 72h post-transf ection and 
purified on nickel resin. Samples 
were then treated with EndoH (+) or 
untreated (-) before SDS-PAGE and 
blotting, as described in the legend 
to Figure 22. 

is an illustration of the DNA and 
polypeptide sequence used for E. coli 
expression of the VEGF-like domain of 
VEGF-X. Polypeptide sequences at the 
N-terminus of the protein derived from 
the vector are underlined. 

shows expression of the VEGF-X VEGF 
domain in E. coli. Lane 1-10^1 broad 
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range marker (New England Biolabs) , 
lane 2-10ul unreduced sample, lane 3- 
lOyil reduced sample. The reduced PDGF 
domain protein (lane 3) has an 
apparent molecular weight of 
approximately 19kDa on SDS-PAGE. 



Figure 26: 



10 



15 



Figure 27: 



20 



illustrates a DNA and polypeptide 
sequence used for E. coli expression 
of the CUB-like domain of VEGF-X- The 
polypeptide sequence at the N-terminus 
derived from the vector-encoded signal 
and the introduced His6 tag are 
underlined. 

shows expression of the VEGF-X CUB 
domain in E. coli. The CUB domain 
protein was purified on Nickel chelate 
resin. The protein migrates at 
approximately 23kDa on SDS-PAGE. 



25 



30 



35 



Figure 28: illustrates the effect of truncated 

VEGF-X (CUB domain) on HUVEC 
proliferation. (A) Human Umbilical 
Vein Endothelial Cells (one-day- 
treatment) . (B) Human Umbilical Vein 
Endothelial Cells (24-hour starving 
followed by one-day- treatment) . (C) 
Effect of VEGF-A 1€5 and VEGF-X CUB 
domain on the proliferation of HUVEC 
(two-day-treatment) . 

Figure 29: depicts the tissue distribution of 

VEGF-X mRNA analysed by Northern 
blotting and RT-PCR in (A) normal 
tissues and (B) tumour tissue and cell 
lines. 
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10 



15 



Figure 30: depicts the partial intron/exon 

structure of the VEGF-X gene. (A) 
Genomic DNA sequences of 2 exons 
determined by sequencing; exon 
sequence is in upper case, intron 
sequence is in lower case. (B) Shows 
the location of splice sites within 
the VEGF-X cDNA sequence. The 
location of mRNA splicing events is 
indicated by vertical lines. The 
cryptic splice donor/acceptor site at 
nt. 998/999 (diagonal lines) gives 
rise to the splice variant forms of 
VEGF-X. No splice site information is 
given for the region shown in italics. 



20 



Figure 31: is a graphic representation of the 

effect of FL-VEGF-X on HuVEC 
proliferation: (24 hour serum 
starvation followed by one day 
treatment) . 



Figure 32: is a graphic representation of the 

combined effect of truncated VEGF-X 
25 (CUB domain) and human recombinant 

VEGF 1€5 on HuVEC proliferation: (24 hour 
serum starvation followed by two day 
treatment) * 



30 Figure 33: 



35 



is a graphic representation of the 
combined effect of the CUB domain and 
human recombinant bFGF on HuVEC 
proliferation: (24 hour serum 
starvation followed by two day 
treatment) . 



Figure 34: 



is a graphic representation of the 



results of a LDH assay for testing 
cytotoxicity of the CUB domain or the 
CUB domain with rhVEGF 16£ . 

Figure 35: is a graphic representation of the 

results obtained from a LDH assay for 
testing cytotoxicity of the CUB domain 
or CUB domain with rh-bFGF. 

A BLAST (Basic Local Alignment Search Tool; Altschul 
et al~, 1990 J. Mol. Biol. 215, 403-410) search was 
performed in the proprietary LifeSeq™ human EST 
database (Incyte Pharmaceuticals/ Inc./ Palo Alto, 
CA, USA) . BLAST produces alignments of both 
nucleotide and amino acid sequences to determine 
sequence similarity. Because of the local nature of 
the alignments, BLAST is especially useful in 
determining exact matches or in identifying 
homologues. While it is useful for matches which do 
not contain gaps, it is inappropriate for performing 
motif-style searching. The. fundamental unit of BLAST 
algorithm output is the High-scoring Segment Pair 
(HSP) . 



Eighteen human EST clones (Figure 14) with high 
similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones 
(Figure 15) were identified using these sequences as 
query sequences, allowing us to deduce the putative 
sequence for the new VEGF-X protein. The sequences 
obtained were compared to known sequences to 
determine regions of homology and to identify the 
sequence as a novel VEGF-type protein. Using the DNA 
sequence information in the databases we were able to 
prepare suitable primers having the sequences of 
VEGF-X 1-10 illustrated in Figure 3 for use in 
subsequent RACE experiments to obtain the complete 
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DNA sequence for the VEGF-X gene. 
Cloning 

5 A profile was developed based on the VEGF-like domain 
in existing VEGF sequences (VEGF-A, B, C and D) . 
This was used to search the public databases and the 
Incyte LifeSeq™ database. No significant novel 
matching sequences were found in the public 
10 databases. All of the matching sequences found in 

the LifeSeq™ database (-1000) were assembled to give 
a smaller number of sequences (-30) , which included 
the known VEGFs and a potential novel VEGF (figures 
1 and 2) . This sequence was named VEGF-X. 



15 



20 



25 



30 



Oligonucleotides were designed to amplify the VEGF-X 
sequence from cDNA (figure 3). The ESTs'found in 
LifeSeq™ were from a range of tissues, with a slight 
predominance of sequences from ovary, testis, 
placenta and lung (Figure 14 and 15) . Accordingly 
the oligonucleotides were used to amplify cDNA 
derived from lung and placenta. First-round PCR 
products were found at ~200bp larger than the 
expected sizes, while 3 major species appeared after 
a second round of PCR amplification, the smallest of 
which was of the expected size. These fragments were 
cloned and sequenced. The smallest fragment did 
indeed have the sequence originally identified from 
the LifeSeq database, while the others contained 
insertions (figure 4) . 



As. the first round of amplification suggested that 
the major species found in cDNA from ovary and 
placenta was not that originally identified in the 
35 LifeSeq™ database, the focus of effort was switched 
to the presumed major species (it seemed likely that 
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clones 57, 25-27 and 2.1kb clones 1-3 in fig 4 
represented the major mRNA species) . Conceptual . 
translation of the DNA sequences of these cloned PGR 
fragments indicated that the complete open reading 
5 frame was not present in the clones or in the 

sequence from LifeSeq™. While all clones contained 
the same sequence in the region of the translation 
termination codon, indicating that the end of the 
open reading frame had been identified, the 5' end of 

10 the open reading frame had not been cloned. 5' RACE 
experiments were therefore carried out in order to 
find the start of the reading frame. PGR primers 
designed for RACE experiments are shown in figure 5. 
RACE PCR products were sequenced directly. Sequence 

15 could be obtained from the 3' end of these RACE 

products but not from the 5' end; probably because 
the products were not cloned and were therefore 
heterogeneous at the 5' end. This new sequence was 
assembled with the existing cloned sequence to give 

20 the sequence shown in figure 6. Searching the 

LifeSeq™ database with this sequence identifies ESTs 
which extend the sequence a further 140bp in the 5' 
direction and a further 160bp in the 3' direction 
(figure 7) . This longer contig was used to design 

25 oligonucleotide primers to amplify the entire coding 
sequence (these primer sequences are shown in figure 
8). PCR was carried out using primers 5'-l and 
vegfXIO (in order to clone a "full-length" cDNA) , and 
with primers 5'-l and vegfX6 (in order to clone the 

30 full coding region, see figure 3 for sequences of 
vegfXIO and vegfX6) - A number of clones were 
obtained for the shorter fragment, of which clones 4 
and 7 contain no PCR errors (sequence of clones 4 & 7 
in figure 9) . A single clone was obtained for the 

35 longer fragment (clone 9), but this sequence appears 
to contain 2 PCR errors. 
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The predicted polypeptide from these longer contigs 
is shown in figure 10. Amino acids 1-22 are 
predicted to encode a signal sequence (von Heijne/ 
5 1986, Nucleic Acids Res. 14, 4683-4690). Figure 11 

shows an alignment of the protein sequence with VEGFs 
A-D, The region homologous to the other VEGFs is 
located towards the C-terminus of the protein. As 
the VEGF homology domain is expected to belong to the 

10 TGF-beta superfamily of growth factors and to consist 
of a dimer containing both intra- and intermolecular 
disulphide bonds, initial alignments focussed on the 
cysteines. However, mapping of the sequence onto the 
known x-ray structure of the VEGF-A receptor-binding 

15 domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 
94, 7192-7197) suggests that the alignment in figure 
11 is plausible, as the extra 4 cysteine residues 
within the VEGF-homology region of VEGF-X (compared 
to this region of VEGF-A) correspond to residues 

20 which are spatially close in VEGF-A, and may 
therefore be able to form disulphide bonds, 

A search of the PFAM database of protein domains with 
the full-length polypeptide sequence from figure 10 

25 identifies two domain consensus sequences within the 
polypeptide. The more C-terminal domain is a "VEGF" 
domain; (the known VEGFs all contain this domain and 
the structure of this region of VEGF-A is similar to 
that of PDGF) . Additionally towards the N-terminus 

30 of the polypeptide there is a CUB domain (amino acids 
-40-150) . The CUB domain is a 100-110 amino acid 
extracellular domain found in a number of 
developmentally-regulated proteins. When the full- 
length protein is used to search the protein 

35 databases using the BLAST 2 algorithm, the scores for 
matches to CUB domain-containing proteins are more 
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significant than those to the other VEGFs. 
Interestingly, the most significant matches are to 
the CUB domains of Neuropilins, and Neuropilin-1 was 
recently identified as a receptor of one of the VEGF- 
5 A isoforms VEGF-A U5 (Soker et al. (1998) Cell 92, 735- 
745) . 

Assuming that the variant sequences isolated by PCR 
(i.e. the smaller PCR fragments) use the same 

10 translation initiation site as the full-length 

sequence, they would result in production of the 
variant proteins shown in figure 12. It may be 
significant that both of these variant proteins 
retain the CUB domain and delete all or part of the 

15 VEGF-like domain. The production of these variant 
sequences can be explained by the use of a cryptic 
splice donor/acceptor site within the VEGF-X sequence 
(figure 30B, between nt. 993/999): one variant arises 
by splicing out of the region between nt. 729-998, 

20 the other by splicing out of the region between nt, 
999-1187, 

Expression 

25 Full-length expression constructs 

Mammalian cells 

Clone 4 containing the full CDS of VEGF-X (see figure 
9), was used to generate constructs for expression of 
full-length protein. The sequence was amplified by 

30 PCR and cloned into the vector pCDNA6/V5-His so as to 
add a C-terminal V5 epitope tag and His 6 tag. The DNA 
and polypeptide sequence in this vector is shown in 
figure 19. Transient expression in COS cells 
followed by western blotting and detection via an 

35 anti-V5 mAb demonstrates the secretion of a protein 
of -50K into the medium in transfected cells only 
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{figure 22A) . This construct can also be used to 
generate VEGF-X expressing stable CHO cell lines. 

Baculovirus/ Insect-cell expression system 
5 For expression in the baculovirus/insect cell system 
the DNA encoding the predicted mature VEGF-X 
polypeptide sequence was fused to a sequence encoding 
a signal derived from melittin, a secreted insect 
protein. An N-terminal 6His tag was also added to 

10 facilitate purification. The insert was then cloned 
into the baculovirus expression vector pFASTBAC. The 
DMA and polypeptide sequence of this construct is 
shown in figure 20. Infection of Trichoplusia ni Hi5 
cells with this recombinant baculovirus results in 

15 the secretion of a protein of approximately 4 5K into 
the medium (data not shown) . 

E. coli 

The coding region of VEGF-X has been cloned in a 
20 variety of ways for expression as a secreted protein 
in E.coll. A particularly useful expression clone 
carries an N-terminal fusion to the E.coli 
maltose-binding protein (MBP- derived from the 
expression vector pMAL-p2, New England Biolabs) and a 
25 C-terminal fusion to a 6His tag. The DNA and 

polypeptide sequence of this vector is shown in 
figure 21. Sequential purification of cell fractions 
on Ni-NTA resin and amylose resin allows the 
isolation of the expressed protein (see figure 22B) , 

30 

expression of fragments 

VEGF 

The VEGF domain of VEGF-X has been expressed in 
E.coli. Similar domains from VEGF-A {Christinger et 
35 al. (1996) PROTEINS: Structure, Function and Genetics 
26, 353-357), and VEGF-D (Achen et al (1998) Proc. 
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Natl. Acad. Scl USA 95, 548-553) have been shown to 
be capable of binding to the respective receptors. 
Expression of these domains was carried out using the 
bacterium £. coll. Additionally, the full-length 
5 protein. was expressed using the baculovirus/insect 

cell expression system. The oligonucleotide primers 
which have been obtained for these experiments are 
shown in figure. 13. The construct directed 
expression in the bacterial cytoplasm, and as 

10 expected the protein was produced in insoluble form 

in inclusion bodies (the DNA and polypeptide sequence 
used for PDGF domain expression is shown in figure 
24) . Inclusion bodies were washed, solubilized with 
urea and the protein purified under denaturing 

15 conditions, before refolding by dialysis to remove 
the urea. Soluble protein was obtained, but shows 
little evidence of the disulphide bond linked dimers 
seen with material derived from animal cells (figure 
25, compare with figure 22A & B) . It is not clear 

20 therefore whether this protein is correctly folded. 

CUB 

The CUB domain has been expressed as a soluble 
secreted protein in E.coli (figure 26). The protein 
25 was purified by binding to Ni-NTA resin (figure 27) 
and assayed for activity on KUVECs in an in-vitro 
proliferation assay. 

Properties of -the VEGF-X protein 

30 The transient mammalian cell expression system 

described above has been used to generate full-length 
VEGF-X protein, as shown by antibody detection 
following Western blotting (see figure 22A) . 
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Disulphide bond linked dimers 

The other members of the PDGF family of growth 
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cleavage between the CUB and VEGF domains, EndoH 
treatment of the preparation gives a slight mobility 
change for the full-length protein (figure 23) , but 
for the smaller VEGF domain fragment there is a clear 
5 change, indicating that the predicted glycosylation # 
site within the VEGF domain at residue 254 is indeed 
glycosylated. 

Activity of proteins in cell-Based assays 
10 Protein samples were tested for activity in cell 
proliferation, cell migration and in-vitro 
angiogenesis assays. Active samples can also be 
tested in the in vivo matrigel mouse model of 
angiogenesis. 

15 

Full-length VEGF-X protein 

Conditioned medium derived from COS cells transiently 
expressing VEGF-X (see figure 22A) displayed no 
detectable activity in any of the assays. However, 

20 as VEGF-X protein could only be detected in this 
preparation by Western blotting, and not by 
Coomassie-staining of gels, it is clearly present at 
very low levels and this may be the reason for the 
observed lack of activity in the cell proliferation, 

25 migration or In vitro angiogenesis tests. 

VEGF domain 

The VEGF domain protein described above has been 
tested in cell proliferation (on a range of cell 
30 types) , cell migration and in vitro angiogenesis 
assays and has failed to show activity in any of 
these tests- As suggested above, this may be due to 
incorrect folding of this protein. 
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CUB domain 

The CUB domain protein at the highest dose tested 
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(lpg/ml) appears to inhibit proliferation of HUVECs 
in the absence of other stimulation (figure 28A & B) . 
This effect is also seen following stimulation with 
the lowest VEGF-A l6S dose tested (Ing/ml- figure 28C) . 
5 The CUB domain of VEGF-X therefore appears to show 

antiproliferative activity on HUVECs, even in the 
presence of low VEGF-A 165 doses . 

Tissue distribution of xnRNA 

10 VEGF-A mRNA expression has been shown to be 

upregulated in a wide variety of human tumors (lung, 
breast, ovarian, colon, stomach, liver, pancreas, 
kidney, bladder and prostate- Takahashi et al, 1995) . 
Tumor VEGF-A expression has been shown to correlate 

15 with tumor growth rate, microvascular density and 
tumor metastasis (Takahashi et al, 1995) . It was 
thus of interest to examine the mRNA expression 
patterns of VEGF-X. Accordingly, Northern blot 
analysis of mRNA derived from different tissues has 

20 been carried out. The results indicate that although 
the VEGF-X mRNA is expressed at low levels, it is 
present in a wide range of tissues. PCR 
amplification of cDNA from a range of tissue sources 
supports this idea (figure 29A) . The major mRNA 

25 species is approximately 3.1kb in size. There is 

no significant upregulation seen in tumour cell lines 
or in tumour tissues tested (figure 29B) , with the 
possible exception of the cell lines GI-117 (lung 
carcinoma) and SaOS-2 (osteosarcoma) . The results of 

30 these initial tissue distribution studies do not, 
therefore, provide evidence for upregulation of 
VEGF-X in tumour growth, as is seen with VEGF-A. 
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Genomic structure of the VEGF-X gene 

A genomic BAC clone covering the 3' part of the 

VEGF-X locus was isolated by hybridisation screening 
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of nylon filters containing a human BAC library. 
Direct sequencing of this clone using oligonucleotide 
primers based on the VEGF-X cDNA sequence allowed the 
determination of several intron/exoh boundaries 
5 {figure 30). Interestingly, the position of the mRNA 
splice site within the PDGF domain (nt 1187/1188 in 
figure 30B) is conserved with respect to those in the 
VEGF-A and VEGF-D genes (Tischer et al, 1991; 
Rocchigiani et al, 1998) . 

10 

Materials & Methods 

FCR, Cloning, DNA sequence determination and BAC 
screening . 

15 All primers were purchased from Eurogentec, Seraing, 
Belgium- Insert-specific sequencing primers (15- and 16- 
mers) were designed by visual inspection of the DNA 
sequences. DNA was prepared on Qiagen-tip-20 columns or 
on Qiaquick spin columns (Qiagen GmbH, Dusseldorf, 

20 Germany) and recovered from the spin columns in 30^1 
Tris/EDTA-buffer (lOmM TrisHCl pH 7,5, 1 nuM EDTA (sodium 
salt) ) . Sequencing reactions were performed using 
BigDye™ Terminator Cycle Sequencing Ready Reaction kits 
(Perkin Elmer, AB1 Division, Foster City, CA, USA) and 

25 were run on an Applied Biosystems 377 DNA sequencer 
(Perkin Elmer, ABI Division, Foster City, CA, USA) . 
Polymerase chain reactions were carried out according 
to standard procedures (Ausubel et al, 1997). The 
PGR fragments were cloned into vectors pCR2 . 1 

30 (Invitrogen, Carlsbad, CA. USA) or pCR-TOPO 

(Invitrogen,NL) according to the manufacturer's 
instructions. One of those vectors, plasmid 
VEGFX/pCR2.1 1TOPO FL 

was deposited on 1 March 1999 under Accession No. 
35 LMBP 3925. After sequence determination, the inserts 
were cloned into the desired expression vectors (see 
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figures 19, 20, 21, 24 & 26). 

A human genomic BAG library (Genome Systems, Inc., St 
Louis, MI, USA) was screened by hybridisation to 
5 oligonucleotides derived from the VEGF-X cDNA 
sequence, according to the manufacturer's 
instructions, 3AC DNA was prepared using a Qiagen 
plasmid midi kit (Qiagen GmbH, Diisseldorf, Germany ) 
according to the manufacturer's instructions with 

10 some modifications (after clearing of the lysate from 
chromosomal DNA, supernatants from individual 
preparations were pooled on a single column (tip 
100), and after the 70 % EtOH wash, the pellet was 
resuspended overnight at 4°C in 100 ul TE) . 20-mer 

IS sequencing primers were designed based on the known 
cDNA sequence, and sequencing carried out as above. 

5' RACE 

20 In order to extend the cDNA clone in a 5' direction 
RACE reactions were carried out. Since it was known 
that the mRNA is present in placenta and skeletal 
muscle, Marathon-Ready™ placenta and skeletal muscle 
cDNAs were purchased from Clontech (Palo Alto CA. 

25 USA) and used according to the manufacturer's 
instructions. DNA fragments were excised from 
agarose gels, purified using QiaQuick PCR 
purification columns (Qiagen GmbH, Diisseldorf , 
Germany) and sequenced directly, 

30 

VEGF-X protein expression and purification 
DNA fragments encoding the desired protein sequences 
were amplified by PCR and cloned into appropriate 
expression vector systems. 

35 

For mammalian cell expression, the full coding 
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sequence was cloned into the vector pcDNA6/V5-his 
(Invitrogen Leek/ NL, see figure 19 for construct 
sequence) , so as to add a C-terminal peptide tag to 
assist in detection and purification. 

5 

For insect cell expression the sequence of the 
predicted mature polypeptide was initially amplified 
to add an N-terminal 6His peptide and then cloned 
into the pMelBacB vector (Invitrogen, Leek, NL) to 
10 add an insect cell signal sequence* The entire 

insert was then PCR-cloned into the vector pFASTBAC-1 
(LifeTechnologies, Gaithersburg, MA, USA) for 
construction of a baculovirus according to the 
manufacturer' s instructions - 

15 

For E.coli expression, the coding region was PCR 
amplified to add a C-terminal 6His tag and then 
cloned into the vector pMAL-p2 (New England Biolabs, 
Beverly, MA, USA) • The coding sequence of this 
20 construct is shown in figure 21) • The protein was 
purified first on Ni-NTA resin (Qiagen GmbH, 
Dttsseldorf, Germany) and then on amylose resin (New 
England Biolabs, Beverly, MA, USA) , according to the 
manufacturers' instructions - 

25 

DNA sequences encoding the CUB and VEGF domain 
fragments of VEGF-X were PCR amplified and cloned 
into pET22b and pET21a (Novagen, Madison, WI, USA) 
respectively* The CUB domain protein was prepared 

30 either from the periplasm or medium of induced 

cultures by standard methods (Ausubel et al, 1997) . 
The protein was initially purified by precipitation 
with 20% ammonium sulphate. After overnight dialysis 
vs 20mM Tris Hcl pH7.5, lOOmM NaCl to remove ammonium 

35 sulphate, the protein was further purified on Ni-NTA 
resin as described above. The VEGF domain protein 
was expressed in insoluble form, and preparation of 
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inclusion bodies was carried out using standard 
procedures (Ausubel et al 1997) ♦ Inclusion bodies 
were dissolved in 6M guanidine hydrochloride/ 20mM 
Tris Hcl pH8.Q, 200mM NaCl, lmM 2-mercaptoethanol, 
5 and purified on Ni-NTA resin (Qiagen GmbH, 

Diisseldorf, Germany) according to the manufacturer's 
instructions. The protein was refolded by dialysis 
against several .changes of buffer containing 
decreasing concentrations of denaturant . 

10 

Analysis of protein glycosylation was carried out 
using EndoH (Roche Molecular Biochemicals, Brussels, 
BE) according to the manufacturer's instructions. 

15 Cell Proliferation Assay 

Human umbilical vein endothelial cells (HUVECs) 
(Clonetics, San Diego, CA.) were trypsinized with 
0.05% trypsin/0. 53mM EDTA (Gibco, Gaithersburg, MD. ) , 
resuspended in the EGM-2 (Clonetics, San Diego, CA.), 

20 counted, and distributed in a 96-well tissue culture 
plate at 5,000 cells/well. Following cell attachment 
and monolayer formation (16 hours) , cells were 
stimulated with various concentrations of truncated 
VEGF-X (CUB domain or VEGF domain) or dilutions of 

25 culture supernatants of the full-length VEGF-X (COS 7 
or HEK293) in DMEM (Gibco, Gaithersburg, MD.) 
containing 0.5% to 2% FBS (HyClone, Logan/ UT) as 
indicated. For human fetal dermal fibroblasts 
(American Type Culture Collection, Rockville, MD.), 

30 the growth medium was replaced by DMEM containing 
0.1% BSA (Sigma, St. Louise, MO.) with or without 
various concentrations of truncated VEGF-X proteins. 
For HCASMC (Clonetics, San Diego, CA.), the medium 
was replaced by DMEM containing 0.5% FBS. The cells 

35 were treated for a further 24 hr-72 hr. For the 

measurement of proliferation, the culture media were 
replaced with 100 pi of DMEM containing 5% FBS and 3 
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In Vitro Angiogenesis Assay 

In vitro angiogenesis in fibrin gels was quantitated 
using spheroids of human umbilical vein endothelial 
cells (Korff et al., 1998). To generate endothelial 
5 cell spheroids of defined size and cell number, a 

specific number of cells (~ 800 cells per spheroid) 
was suspended in EGM-2 culture medium containing 20% 
methylcellulpse , (Sigma, St, Louis, MO.), seeded into 
nonadherent round-bottom 96-well plates. All 

10 suspended cells in one well contributed to the 
formation of a single endothelial cell spheroid 
within 24 hours. A fibrin gel stock solution was 
prepared freshly prior to use by mixing 3mg/ml 
fibrinogen (Calbiochem, San Diego, CA. ) in Medium 

15 199(Gibco, Gaithersburg, MD. ) . Assays were performed 
in 24-well culture plates. The 1ml fibrinogen stock 
was mixed with 50 HUVEC spheroids and the 
corresponding test substance including rh-VEGF 165 or 
various concentration of VEGF-X. The 

20 spheriod-containing fibrinogen was rapidly 

transferred into 24-well plates. Fifteen microliters 
of thrombin (100 NIH 0/ml stock, Sigma, St. Louis, 
MO.) was added to the gel for the fibrin gel 
formation. The gel formation usually occurred within 

25 30 seconds. After gel formation, lml /well of Medium 
199 supplemented with 20% FBS, lmg/ml s-aminocaproic 
acid (Calbiochem, San Diego, CA.) and antibiotics 
were added. The gel was incubated at 37 °C (5%C0 2 , 95% 
air, 100% humidity). After 3 days, in vitro 

30 angiogenesis was quantitated by measuring the length 
of the three longest capillary sprouts that had grown 
out of each spheroid (100X magnification), analyzing 
at least 10 spheroids per experimental group and 
experiment . 

35 

Matrigel Mouse Assay 
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The matrigel mouse assay is carried out as described 
by Passanti et al (1992). 

Analysis of VEGF-X gene expression by RT-PCR 
5 analysis. 

Oligonucleotide primers VEGF-E2 and VEGF-X14 (figure 
16; figure 5) were used for the specific PCR 
amplification of a 350 bp fragment from VEGF-X. PCR 
amplifications were performed on human multiple 

10 tissue cDNA (MTC™) panels (Clontech human MTC panels 
I and II and human Tumor MTC panel) normalised to the 
mRNA expression levels of six different housekeeping 
genes. In addition, cDNA was made from different 
tumor cell cultures (Caco-2 colorectal 

15 adenocarcinoma; T-84 colorectal carcinoma; MCF-7 
breast adenocarcinoma; T-47D breast ductal gland 
carcinoma; HT1080 bone fibrosarcoma; SaOS-2 
osteosarcoma; SK-N-MC neuroblastoma; HepG2 
hepatoblastoma; JURKAT T-cell leukemia and THP-1 

20 myelomonocytic leukemia) . For the preparation of 

tumor cell cDNA/ cells were homogenised and total RNA 
prepared using the RNeasy Mini kit (Qiagen GmbH/ 
Hilden, Germany) according to manufacturer's 
instructions. 1 pg of total RNA was reverse 

25 transcribed using oligo(dT)15 as a primer and 50 U of 
Expand™ Reverse Transcriptase (Boehringer Mannheim/ 
Mannheim/ Germany) according to the manufacturer's 
instructions. PCR reactions with VEGF-X-specif ic or 
glyceraldehyde-3 -phosphate dehydrogenase 

30 (G3PDH) -specif ic primers were then performed on 1 pi 
of this cDNA. For all cDNAs, PCR reactions with 
VEGF-X specific primers were performed in a total 
volume of 50 containing 5 jal (±1 ng) of cDNA, Ix 
Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 

35 250 nM of primers VEGF-E2 and VEGF-X14 and 1 \il of 

Advantage KlenTaq polymerase mix. Samples were heated 
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to 95°C for 30 s and cycling was done for 30 s at 
95°C and 30 s at 68°C for 25, 30 or 35 cycles. 
Control reactions using specific primers that amplify 
a 1 kb fragment of the housekeeping gene G3PDH were 
5 also performed according to the manufacturer's 
instructions • 

Northern blot analysis of VEGF-X . 

Northern blots containing 2 ]iq of poly (A) -rich RbJA 
10 derived from different human tissues (Clontech 

Laboratories; MTN™ blot, MTN™ blot II and Cancer Cell 
Line MTN™ blot) were hybridized according to the 
manufacturers instructions with a a- [ 32 P] -dCTP 
random-priming labelled (Multiprime labelling kit, 
15 Roche Diagnostics) 293 bp specific VEGF-X fragment 
(PinAI-StuI fragment including 92 bp of the 3' end 
coding region and 201 bp of the 3' untranslated 
region of VEGF-X) . The blots were hybridized 
overnight at 68 °C and final washes at high stringency 
20 were at 68 8 C in O.lx SSC/0.1 % SDS. The membranes 
were autoradiographed for 1 to 3 days with 
intensifying screens. 

Full length VEGF-X 

25 The effect of full length VEGF-X on proliferation of 
HuVEC cells was determined by the 3 H-Thymidine 
incorporation assay- HuVEC cells were serum starved 
for 24 hours prior to treatment with the full length 
VEGF-X at the concentration range from 100 pg/ml-10 

30 pg/ml. There was no effect of VEGF-X at 100 pg/ml-10 
ng/ml on endothelial cell proliferation. AZ the 
higher concentrations of FL-VEGF-X (100 ng/ml and 1 
Ug/ml) there was a marked inhibition of endothelial 
cell proliferation. This is probably due to the very 

35 high endotoxin level in the samples. The VEGF-X 
sample was purified in order to decrease the 
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endotoxin level and is currently tested in the cell 
proliferation assay. 

The Summary from Testing the CUB Domain 
5 The effect of CUB domain on inhibition of HuVEC 

prolieration either serum- (2%), rh-VEGF or bFGF- 
stimulated, was assessed by the 3 H-Thymidine 
incorporation assay. Cells were serum starved 
followed by the treatment with the CUB domain and 

10 various growth factors. Results showed that the CUB 
domain inhibited endothelial cell proliferation, 
either serum- (2%), rh-VEGF or bFGF-stimulated in a 
dose dependent manner with maximal inhibition at 10 
pg/ml. There was approximately a 2-fold inhibition 

15 of proliferation {at 10 ng/ml) of cells stimulated 

with VEGF and bFGF and nearly a 5-fold inhibition of 
cells stimulated with serum (2%) * Results with the 
LDH assay showed that there was no cytotoxicity 
associated with the inhibition of cell proliferation 

20 by the CUB domain. 

Therefore, the N-terminus of the polypeptide from 
Figure 10 has been shown to possess a CUB domain. 
When database searches are carried out using the 

25 full-length coding sequence the best matches (i.e. 

for a BLAST search, those with the lowest probability 
score) are found with the CUB domain rather than with 
the VEGF-like domain. The best match from searching 
release 37 of the SWISSPROT database (Feb 1999) is to 

30 the CUB domain of a neuropilin from Xenopus laevis, 
and the matches to the CUB domains of human 
neuropilins 1 and 2 are also more significant than 
matches to the VEGFs. 

35 This similarity is provocative, given the 

identification of neuropilin-1 and -2 as cellular 
receptors for the VEGF- A 165 (Stoker et al. 1998, 
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reviewed in Neufeld et al. 1999) . It is plausible 
therefore that VEGF-X could exert dual regulatory 
effects: via interaction with the tyrosine kinase 
VEGF-receptors mediated by the VEGF-like domain, as 
5 well as via interaction with VEGF isoforms or with 
the neurophilin receptors, mediated by the CUB 
domain. 

To the best of our understanding the latter would be 
10 entirely novel, and searches on the most recent 
release of the Incyte database do not reveal any 
other proteins containing both CUB and VEGF-like 
domains. This arrangement of domains suggests 
possible positive or negative models of regulation: 

15 

Positive- the VEGF-like domain is able to interact 
productively with the tyrosine kinase VEGF receptors 
giving activation, and the CUB domain is able to 
interact productively with the neuropilin receptor 
20 giving activation. 

Negative- the VEGF-like domain does not interact 
productively with the tyrosine kinase VEGF receptors/ 
either preventing receptor dimerisation or blocking 

25 the VEGF binding sites. Further, the CUB domain does 
not interact productively with the neuropilin 
receptors, either preventing receptor activation or 
blocking the VEGF binding sites, or indeed by binding 
to VEGF isoforms and preventing their interaction 

30 with receptors. 
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TABLE 1 



15 



ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 


ALA 


SER, 


THR 




ARG 


LYS 






ASN 


HIS, 


SER 




ASP 


GLU, 


ASN 




CYS 


SER 






GLN 


ASN, 


HIS 




GLU 


ASP, 


GLU 




GLY 


ALA, 


SER 




HIS 


ASN, 


GLN 




ILE 


LEU, 


VAL, 


THR 


LEU 


ILE, 


VAL 




LYS 


ARG, 


GLN, 


GLU, THR 


MET 


LEU, 


ILE, 


VAL 


PHE 


LEU, 


TYR 




SER 


THR, 


ALA, 


ASN 


THR 


SER, 


ALA 




TRP 


ARG, 


SER 




TYR 


PHE 






VAL 


ILE, 


LEU ALA 


PRO 


ALA 
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Sequence ID No 1 



corresponds to the amino acid 
sequence from position 23 to 345 
of the amino acid sequence 
illustrated in Figure 10. 



10 



15 



Sequence ID No 2 



Sequence ID No 3 



Sequence ID No 4 



is the amino acid sequence 
illustrated in Figure 10. 

corresponds to the sequence from 
position 257 to 1291 of the 
nucleotide sequence illustrated 
in Figure 9. 

corresponds to the polynucleotide 
sequence of VEGFX1 illustrated in 
Figure 3. 



20 



Sequence ID No 5 



corresponds to the polynucleotide 
sequence of VEGFX2 illustrated in 
Figure 3, 



25 



Sequence ID No 6 



corresponds to the polynucleotide 
sequence of VEGFX3 illustrated in 
Figure 3. 



30 



Sequence ID No 7 



corresponds to the polynucleotide 
sequence of VEGFX4 illustrated in 
Figure 3. 



35 



Sequence ID No 8 



Sequence ID No 9 



corresponds to the polynucleotide 

sequence of VEGFX5 illustrated in 
Figure 3. 

corresponds to the polynucleotide 

sequence of VEGFX6 illustrated in 
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Figure 3. 

Sequence ID No 10 corresponds to the polynucleotide 

sequence of VEGFX7 illustrated in 
5 Figure 3. 

Sequence ID No 11 corresponds to the polynucleotide 

sequence of VEGFX8 illustrated in 
Figure 3* 

10 

Sequence ID No 12 corresponds to the polynucleotide 

sequence of VEGFX9 illustrated in 
Figure 3. 

15 Sequence ID No 13 corresponds to the polynucleotide 

sequence of VEGFX10 illustrated 
in Figure 3 . 

Sequence ID No 14 corresponds to the polynucleotide 
20 sequence of VEGFX11 illustrated 

in Figure 4 . 

Sequence ID No 15 corresponds to the polynucleotide 

sequence of VEGFX12 illustrated 
25 in Figure 4 . 

Sequence ID No 16 corresponds to the polynucleotide 

sequence of VEGFX13 illustrated 
in Figure 4 . 

30 

Sequence ID No 17 corresponds to the polynucleotide 

sequence of VEG FX 14 illustrated 
in Figure 4 . 

35 Sequence ID No 18 corresponds to the polynucleotide 

sequence 5'-l in Figure 8* 
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Sequence ID No 19 
Sequence ID No 20 

5 

Sequence ID No 21 

10 

Sequence ID No 22 

15 

Sequence ID No 23 
20 Sequence ID No 24 
Sequence ID No 25 

25 

Sequence ID No 26 

30 

Sequence ID No 27 

35 

Sequence ID No 28 
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corresponds to the polynucleotide 
sequence 5 ' -2 in Figure 8. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 13, 

corresponds to the polynucleotide 
sequence of VEGBAC1 illustrated 
in Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC2 illustrated 
in Figure 13. 

corresponds to a polypeptide 
having the amino acid sequence 
from amino acid position 40 to 
150 of the sequence of Figure 10. 

corresponds to a polypeptide 
having the amino acid sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
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5 Sequence ID No 29 



10 Sequence ID No 30 



position 5 to 508 of the 
nucleotide sequence illustrated 
in Figure 26. 

corresponds to the nucleotide 
sequence from position 5 to 508 
of the nucleotide sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
position. 214 to 345 of the 
nucleotide sequence illustrated 
in Figure 10. 
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CIAIMS 

1. A nucleic acid molecule encoding a VEGF-X 
protein or a functional equivalent, derivative or 
5 bioprecursor thereof, said protein comprising any of 
the sequences from position 23 to 345 of the amino 
acid sequence illustrated in Figure 10, or the 
complete sequence as illustrated in Figure 10. 

10 2- A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

3. A nucleic acid molecule according to claim 1 or 
2 wherein said nucleic acid is a cDNA molecule. 

15 

4. A nucleic acid molecule according to claim 3 
comprising the nucleotide sequence from position 257 
to 1291 of the nucleotide sequence illustrated in 
Figure 9, or sequences that hybridise thereto under 

20 high stringency conditions or the complement thereto. 

5. An antisense molecule capable of hybridising to 
a molecule according to any of claims 1 to 4 under 
high stringency conditions. 

25 

6. A nucleic acid molecule according to any of 
claims 1 to 4 which is of mammalian origin. 

7. A nucleic acid molecule according to claim 6 
30 which is of human origin. 

8. An isolated VEGF-X protein, or a functional 
equivalent, derivative or bioprecursor thereof, 
having an amino acid sequence from position 23 to 345 

35 of the amino acid sequence illustrated in Figure 10 
or the complete amino acid sequence of Figure 10. 
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9. A VEGF-X protein, or a functional equivalent; 
derivative or bioprecusor thereof, encoded by a 
nucleic acid molecule as defined in any of claims 1 
to 4. 

5 

10. A protein according to claim 9, which comprises 
the amino acid sequence illustrated in Figure 10. 

11. An expression vector comprising a nucleic acid 
10 molecule according to any of claims 1 to 4 . 

12. An expression vector according to claim 11 
further comprising a nucleotide sequence encoding a 
reporter molecule. 

15 

13. An expression vector comprising an antisense 
molecule according to claim 5. 

14. A nucleic acid molecule according to any of 

20 claims 1 to 4 or an antisense molecule according to 
claim 5 for use as a medicament. 

15. A host cell transformed or transfected with an 
expression vector according to claim 11 or 12. 

25 

16. A host cell transformed or transfected with an 
expression vector according to claim 13. 

17. A transgenic cell, tissue or organism comprising 
30 a transgene capable of expressing a VEGF-X protein 

according to claim 8 or 9. 

18. A transgenic cell, tissue or organism according 
to claim 17, wherein said transgene is included in an 

35 expression vector. 



19. A VEGF-X protein or a functional equivalent, 
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derivative or bioprecursor thereof/ expressed by a 
cell according to claim 15. 

20. A VEGF-X protein, or a functional equivalent, 
5 derivative or bioprecursor thereof, expressed by a 
transgenic cell, tissue or organism according to 
claim 17. 

21- A process for producing a VEGF-X protein 
10 according to any of claims 8 to 10, said process 

comprising transforming a host cell or organism with 
an expression vector according to claim 11, and 
recovering the expressed protein from said host cell 
or organism. 

15 

22. An antibody capable of binding to a protein 
according to any of claims 8 to 10, or an epitope 
thereof. 

20 23. An antibody according to claim 22 for use as a 
medicament. 

24. A pharmaceutical composition comprising an 
antibody according to claim 22 together with a 

25 pharmaceutical^ acceptable carrier diluent or 
excipient thereof. 

25. A method of identifying VEGF-X protein in a 
sample which method comprises contacting said sample 

30 with an antibody according to claim 22 and monitoring 
for binding of any protein to said antibody. 

26. A kit for identifying the presence of VEGF-X 
protein in a sample which comprises an antibody 

35 according to claim 22 and means for contacting said 
antibody with said sample. 
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27. A method of identifying compounds which modulate 
angiogenesis which method comprises providing a host 
cell or organism according to claim 15 or a 
transgenic cell, tissue or organism according to 

5 claim 17, contacting a test compound with said cell/ 
tissue or organism and monitoring for an effect of 
said compound on said VEGF compared to a host cell or 
organism according to claim 15 or a transgenic cell 
tissue or organism according to claim 17 which has 
10 not been contacted with said compound. 

28. A compound identifiable according to the method 
of claim 27. 

15 29, A compound according to claim 28 for use as a 
medicament . 

30. A nucleic acid sequence comprising the 
nucleotide sequences illustrated in any of Figures 3, 

20 5, 8 or 13. 

31. A method for producing a polypeptide, said 
method comprising the steps of: 

25 a) culturing the host cell of claim 15 under 

conditions suitable for expression of the 
polypeptide; and 
b) recovering the polypeptide from the host 
ceil culture. 

30 

32. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues/ tissue regeneration and organ 

35 and tissue repair in a subject said method comprising 
administering to said subject an amount of an 
antisense molecule according to claim 5 in sufficient 
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concentration to reduce or prevent said angiogenic 
activity. 

33. A method of inhibiting angiogenic activity or 
5 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
10 an amount of an antibody according to claim 22 in 
sufficient concentration to reduce or prevent said 
angiogenic activity or inappropriate vascularisation. 

34. A method of inhibiting angiogenic activity or 
15 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a 
subject, said method comprising implanting in said 
20 subject cells that express an antibody according to 
claim 22. 

35. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 

25 retinopathy, said method comprising administering to 
said subject an amount of an antisense molecule 
according to claim 5 in sufficient concentration to 
treat or prevent said disorders. 

30 36. A method of treating or preventing any of 

cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antibody according to 
claim 22 in sufficient concentration to reduce or 

35 prevent said disorders. 

37. A method of promoting angiogenic activity or 
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vascularisation to promote wound healing, skin graft 
growth, tissue repair, proliferation of new blood 
vessels, tissue regeneration and organ repair which 
method comprises applying or delivering to a site of 
interest a therapeutically effective amount of any of 
a group selected from a protein according to claim S 
and a nucleic acid molecule encoding a VEGF-X protein 
or a functional equivalent, derivative or 
bioprecursor thereof comprising an amino acid 
sequence illustrated in Figure 10, an expression 
vector comprising said nucleic acid molecule and a 
pharmaceutical composition comprising any of said 
nucleic acid molecule and said protein. 

38. A method of treating wounds selected from the 
group consisting of dermal ulcers, pressure sores, 
venous sores, diabetic ulcers and burns by applying 
to said wound a therapeutically effective amount of 
any of a VEGF-X protein according to claim 8, a 
pharmaceutical composition comprising said protein 
and a pharmaceutical^ acceptable carrier, diluent or 
excipient therefor . 

39. A nucleic acid molecule encoding a polypeptide 
having a CUB domain said polypeptide comprising the 
amino acid sequence from position 4 0 to 150 of the 
sequence of Figure 10. 

40. A nucleic acid molecule encoding a polypeptide 
having a CUB domain, said polypeptide comprising the 
amino acid sequence of Figure 26. 

41. A nucleic acid molecule according to claim 39 or 
40, comprising the nucleotide sequence from position 
5 to 508 of the sequence illustrated in Figure 26. 



42- A nucleic acid molecule according to any of 
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claims 3 9 to 41 comprising the nucleotide sequence 
illustrated in Figure 26. 

43. A nucleic acid molecule encoding a VEGF like 

5 domain comprising the sequence from position 214-345 
of the sequence of Figure 10 or the sequence from 
position 15 to 461 illustrated in Figure 24. 

44. An expression vector comprising a nucleic acid 
10 molecule according to any of claims 39 to 42. 

45. An expression vector comprising a nucleic acid 
molecule according to claim 43. 

!5 46. A host cell transformed or transfected with an 
expression vector according to claim 44. 

47. A host cell transformed or transfected with an 
expression vector according to claim 45. 

20 

48. A protein expressed by the cell according to 
claim 46. 

49. A protein expressed by the cell according to 
25 claim 47 . 

50. A method of identifying compounds that inhibit 
or enhance angiogenic activity, said method 
comprising contacting a cell expressing a VEGF 

30 receptor and/or a neuropilin 1 or 2 type receptor 
with said compound in the presence of a VEGF-X 
protein according to claim 8 and monitoring for the 
effect of said compound or said cell when compared to 
a cell which has not been contacted with said 

35 compound. 



51. A compound identifiable according to the method 
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of claim 50 as an inhibitor or enhancer of angiogenic 
activity. 

52. A method of inhibiting angiogenic activity or 

5 inappropriate vascularisation, said method comprising 
contacting a cell expressing, a VEGF receptor and a 
neuropilin type receptor with a protein selected from 
any of a protein according to any of claims 6 to 10 
and a protein according to claim 48 or a protein 
10 according to claim 49. 

53. Use of a nucleotide sequence illustrated in any 
of Figures 14 and 15 in identifying a VEGF-X protein 
according to claim 8. 

15 

54. A nucleic acid molecule encoding a polypeptide 
comprising a CUB domain having the sequence from 
position 40 to 150 of the sequence of Figure 10 or 
from position 5 to 508 of the sequence of Figure 26 

20 and a sequence encoding a VEGF domain, 

55. A nucleic acid molecule according to claim 54 
wherein said sequence encoding said VEGF domain is 
selected from the sequences encoding any of VEGF A to 

25 D or isoforms or variants thereof. 

56. A nucleic acid molecule encoding a polypeptide 
comprising the amino acid sequence from position 4 0 
to 150 of the sequence illustrated in Figure 10 for 

30 use as a medicament. 

57. Use of a nucleic acid molecule encoding a 
polypeptide having the amino acid sequence from 
position 40 to 150 of the sequence illustrated in 

35 Figure 10 in the manufacture of a medicament for 
treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour or cancer 
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growth, retinopathy, osteoarthritis or psoriasis. 

58. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 

5 in figure 10 for use as a medicament, 

59. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in Figure 10 in the manufacture of a medicament for 

10 the treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour growth, 
retinopathy, osteoarthritis or psoriasis. 

60. Use of a CQB domain comprising the amino acid 
15 sequence from position 40 to 150 of the sequence of 

Figure 10, or the amino acid sequence of Figure 2 6, 
to identify compounds which inhibit angiogenic 
activity in a method according to claim 50. 

20 61. A method of inhibiting angiogenic activity and 

inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 

25 administering to said subject an amount of a 

polypeptide having an amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 or a nucleic acid molecule according to any 
of claims 39 to 42 in sufficient concentration to 

30 reduce or prevent said angiogenic activity. 

62* A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
35 said subject an amount of a polypeptide having an 
amino acid sequence from position 40 to 150 of the 
sequence illustrated in Figure 10 or a nucleic acid 
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molecule according to any of claims 39 to 42 in 
sufficient concentration to treat or prevent said 
disorders. 

5 63. An antisense molecule capable of hybridising to 
a molecule according to any of claims 39 to 42 under 
high stringency conditions. 

64. An antisense molecule capable of hybridising to 
10 a molecule according to claim 43 under high. 

stringency conditions. 

65. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 

15 to claim 48. 

66. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
to claim 49. 

20 

67. A transgenic, cell tissue or organism 
according to claim 65 or 66, wherein said transgene 
is included in an expression vector according to 
claim 41 or 42. 

25 

68. An antibody capable of binding to a protein 
according to claim 48 or an epitope thereof- 

69. An antibody capable of binding to a protein 
30 according to claim 4 9 or an epitope thereof. 

70. A pharmaceutical composition comprising an 
antibody according to claim 68 or 69 together with a 
pharmaceutical^ acceptable carrier diluent or 

35 excipient therefor. 

71. A pharmaceutical composition comprising a 
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compound according to claim 4 8 together with a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

72. A nucleic acid molecule encoding a variant of a 
VEGF-x protein having any of the sequences of 
nucleotides illustrated in Figure 12. 
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1 AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC AC T AC C TTG A TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG ATTTC CTTTA GTTTAATCCT 

+1 MetAsn IlePheLeu LeuAsnLeuLeu TiirGluGlu ValArgLeu 

] 

181 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

+1 TyrSerCysThr ProArgAsn PheSerVal SerlleArgGlu GluLeuLys ArgThrAsp 



2 41 TACAGCTGCA CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT 
ATGTCGACGT GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA 

+1 ThrllePheTrp ProGlyCys LeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys 



3 01 ACCATTTTCT GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 
TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

+1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 



3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+1 ValLeuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuHis LysSerLeu TiirAspVal 



421 GTCCTTCAGT TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTCTGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+1 AlaLeuGluHis HisGluGlu CysAspCys ValCysArgGly SerThrGly Gly 



4 81 GCCCTGGAGC ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG 
CGGGACCTCG TGGTACTCCT CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC 

541 CATCACCACC AGCAGCTCTT GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA 
GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA 
TGCATACGCA ATAGAGGTAG GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT 

661 GGATTTACAG TGCATTCTGA AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC 
CCTAAATGTC ACGTAAGACT TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG 

721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA 
AGAAAACTCT CCTCCGGATT TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT 

7 81 AATGTTGTAT TAAATAGATC ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG 
TTACAACATA ATTTATCTAG TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC 
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841 CTGGGTTCTG TATTTCAGTT CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA 

GACCCAAGAC ATAAAGTCAA GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT 

901 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 

TGACACGTTC ACTCGTGGAC TAAGGCAACG GAACGAATTG AGATTTCGAG GTACAGGACC 

9 61 GCCTAAAATC GTATAAAATC TGGATTTTTT TTTTTTTTTT TGCTCATATT CACATATGTA 

CGGATTTTAG CATATTTTAG ACCTAAAAAA AAAAAAAAAA ACGAGTATAA GTGTATACAT 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG GTTTTTAAAA AGGAACTATG TTGCTATGAA 

TTGGTCTTGT AAGATACATG ATGTTTGGAC CAAAAATTTT TCCTTGATAC AACGATACTT 

1081 TTAAACTTGT GTCGTGCTGA TAGGACAGAG TGGATTTTTC ATATTTCTTA TTAAAATTTC 

AATTTGAACA CAGCACGACT ATCCTGTCTG ACCTAAAAAG TATAAAGAAT AATTTTAAAG 

1141 TGCCATTTAG AAGAAGAGAA CTACATTCAT GGTTTGGAAG AGATAAACCT GAAAAGAAGA 

ACGGTAAATC TTCTTCTCTT GATGTAAGTA CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

12 01 GTGGCCTTAT CTTCACTTTA TCGATAAGTC AGTTTATTTG TTTCATTGTG TACATTTTTA 

CACCGGAATA GAAGTGAAAT AGCTATTCAG TCAAATAAAC AAAGTAACAC ATGTAAAAAT 

12 61 TATTCTCCTT TTGACATTAT AACTGTTGGC TTTTCTAATC TTGTTAAATA TATCTATTTT 

ATAAGAGGAA AAGTGTAATA TTGACAACCG AAAAGATTAG AACAATTTAT ATAGATAAAA 

1321 TACCAAAGGT ATTTAATATT CTTTTTTATG ACAACTTAGA TCAACTATTT TTAGCTTGGT 

ATGGTTTCCA TAAATTATAA GAAAAAATAC TGTTGAATCT AGTTGATAAA AATCGAACCA 

13 81 AAATTTTTCT AAACACAATT GTTATAGCCA GAGGAACAAA GATGATATAA AATATTGTTG 

TTTAAAAAGA TTTGTGTTAA CAATATCGGT CTCCTTGTTT CTACTATATT TTATAACAAC 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC GTATGGTGCT AGAG TTAGAT TAATCTGCAT 

GAGACTGTTT TTATGTACAT AAAGTAAGAG CATACCACGA TCTCAATCTA ATTAGACGTA 

15 01 TTTAAAAAAC TGAATTGGAA TAGAATTGGT AAGTTGCAAA GACTTTTTGA AAATAATTAA 

AAATTTTTTG ACTTAACCTT ATCTTAACCA TTCAACGTTT CTGAAAAACT TTTATTAATT 

1561 ATTATCATAT CTTCCATTCC TGTTATTGGA GATGAAAATA AAAAGCAACT TATGAAAGTA 

TAATAGTATA GAAGGTAAGG AC AATAACC T CTACTTTTAT TTTTCGTTGA ATACTTTCAT 

1621 GACATTCAGA TCCAGCCATT ACTAACCTAT TCCTTTTTTG GGGAAATCTG AGCCTAGCTC 

CTGTAAGTCT AGGTCGGTAA TGATTGGATA AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

1681 AGAAAAACAT AAAGCACCTT G AAAAAG AC T TGGCAGCTTC CTGATAAAGC GTGCTGTGCT 

TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA ACCGTCGAAG GACTATTTCG CACGACACGA 

1741 GTGCAGTAGG AACACATCCT ATTTATTGTG ATGTTGTGGT TTTATTATCT TAAACTCTGT 

CACGTCATCC TTGTGTAGGA TAAATAACAC TACAACACCA AAATAATAGA ATTTGAGACA 

1801 TCCATACACT TGTATAAATA CATGGATATT TTTATGTACA GAAGTATGTC TCTTAACCAG 

AGGTATGTGA AC AT ATTT AT GTACCTATAA AAATACATGT CTTCATACAG AGAATTGGTC 

18 61 TTCACTTATT GTACCTGG 

AAGTGAATAA CATGGACC 
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Predicted VEGF-Iike protein encoded by Incyte contig of 8/12/98 

1 KNIFLLNLLT ESVRLYSCTP RNFSVSIRSE LKRTDTIFW? GCLLVKRCGG 
51 NCACCLHNCN ECQCVPSKVT KXYHEVLQLR PXTGVRGLHK SLTDVALEHH 
101 ESCDCVCRGS TGG 



PCR primers for cloning VEGF-X 



vegfXl 
vegfX2 
vegfX3 
vegfX4 
vegfX5 
vegfXS 
vegfX7 
vegfX8 
vegfX9 
vegfXl 0 



AAAATGTATGGATACAACTTAC 



GTTTGATGAAAGATTTGGGCTTG 



TTTCTAAAGGAAATCAAATTAG 



GATAAGATTTGTATCTGATG 



GATGTCTCCTCTTTCAG 



G C AC AACTCCTAATTCTG 



AGCACCTGATTCCGTTGC 



TAGTACATAGAATGTTCTGG 



AAGAGACATACTTCTGTAC 



CCAGGTACAATAAGTGAACTG 
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^ Variants 



JRF) 



isolated by PCR (at 8/2/99, ali cloned and sequenced at 



a b 
PCR primers- —> 



c d 



e f 



Incyte contig 
(8/12/98) 




clone 22, 29, 41 £ 



clone 52, 59 



clone 15, 20 



clones 57, 25, 
26, 27 



2.1kb clones 1, 
2,3 



primers- a- vegfXI 
(see fig 3) d- vegfXS 



b- vegfX2 
e- vegfX9 



c- vegfX5 
f- vegfX-1 0 
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/^^P. yS. VEGF-X 5* RACE primers 

vegfXll C CTTTAG AAATCTGTTTTCCTG GTAC AG 

vegfXl2 G G AAAATATTC ATC AG ATACAAATCTTATCC 

ve 9fXl3 GGTCCAGTGGCAAAGCTGAAGG 
ve 9 fX14 CTGGTTCAAGATATCGAATAAGGTCTTCC 
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DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCTTC CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACGTCGGAAG GGGACCGCCA CCACTTTCTC 

61 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTGA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+2 HetSerLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGltiArg GlnGlyTh 
] 

121 AATGAGCCTC TTCGGGCTTC TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+ 2 rGlriAlaGlu SerAsnLeuSer SerLysPhe GlnPheSer SerAsnLysGlu GlnAsnGl 



181 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATCATTTAA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAsp ProGlnHisGlu Argllelle ThrValSer ThrAsnGlySer IleHisSe 



2 41 AGTACAAGAT CCTCAGCATG AGAGAATTAT TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGTTCTA GGAGTCGTAC TCTCTTAATA ATGACACAGA TGATTACCTT CATAAGTGTC 

+2 rProArgPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu ValAlaVa 



3 01 CCCAAGGTTT CCTCATACTT ATC C AAG AAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GGGTTCCAAA GGAGTATGAA TAGGTTCTTT ATGCCAGAAC CATACCTCTA ATCATCGTCA 

+2 IGluGluAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 



3 61 AGAGGAAAAT GTATGGATAC AACTTACGTT TGATGAAAGA TTTGGGCTTG AAGACCCAGA 
TCTCCTTTTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC TTCTGGGTCT 

+2 uAspAspIle CysLysTyrAsp PheValGlu ValGluGlu ProSerAspGly ThrlleLe 



421 AGATGACATA TGCAAGTATG ATTTTGTAGA AGTTGAGGAA CCCAGTGATG GAACTATATT 
TCTACTGTAT ACGTTCATAC TAAAACATCT TCAACTCCTT GGGTCACTAC CTTGATATAA 

+2 uGlyArgTrp CysGlySerGly ThrValPro GlyLysGln IleSerLysGly AsnGlnll 



4 81 AGGGCGCTGG TGTGGTTCTG GTACTGTACC AGGAAAACAG ATTTCTAAAG GAAATCAAAT 
TCCCGCGACC AC AC C AAG AC CATGACATGG TCCTTTTGTC TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPhe ProSerGlu ProGlyPheCys IleHisTy 



541 TAGGATAAGA TTTGTATCTG ATGAATATTT TCCTTCTGAA CCAGGGTTCT GCATCCACTA 
ATCCTATTCT AAACATAGAC TACTTATAAA AGGAAGACTT GGTCCCAAGA CGTAGGTGAT 

+2 rAsnlleVal MetProGlnPhe ThrGluAla ValSerPro SerValLeuPro ProSerAl 



601 CAACATTGTC ATGCCACAAT TC AC AG AAG C TGTGAGTCCT TCAGTGCTAC CCCCTTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCTTCG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+2 aLeuProLeu AspLeuLeuAsn AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 



6 61 TTTGCCACTG GACC TGCTTA ATAATGCTAT AACTGCCTTT AGTACCTTGG AAGACCTTAT 
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+ 2 eArgTyrLeu GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 



21 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GATCTATATA GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCTGAATCTT C TAG ATA TAT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 



81 GCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC TAGACTTGGA 

+2 uLeuThrGlu GluValArgLeu TyrSerCys ThrProArg AsnPheSerVal SerlleAr 



8 41 TCTAACAGAG GAGGTAAGAT TATACAGCTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGluGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 



9 01 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAACGCTG 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGCGAC 

+2 sGlyGlyAsn CysAlaCysCys LeuHisAsn CysAsnGlu CysGlnCysVal ProSerLy 

9 61 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACGGACAA CAGAGGTGTT AACGTTACTT AC AG TT AC AC AGGGTTCGTT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 



1021 AGTTACTAAA AAATACCACG AGGTCCTTCA GTTGAGACCA AAGACCGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 



sLysSer LeuThrAspVal AlaLeuGlu HisHisGlu GluCysAspCys ValCysAr 



1081 


GCACAAATCA 
CGTGTTTAGT 


CTCACCGACG 
GAGTGGCTGC 


TGGCCCTGGA 
ACCGGGACCT 


GCACCATGAG 
CGTGGTACTC 


GAGTGTGACT 
CTCACACTGA 


GTGTGTGCAG 
CACACACGTC 


+ 2 


gGlySerThr 


GlyGly 










1141 


AGGGAGCACA 
TCCCTCGTGT 


GGAGGATAGC 
CCTCCTATCG 


CGCATCACCA 
GCGTAGTGGT 


CCAGCAGCTC 
GGTCGTCGAG 


TTGCCCAGAG 
AACGGGTCTC 


CTGTGCAGTG 
GACACGTCAC 


1201 


CAGTGGCTGA 
GTCACCGACT 


TTC T ATT AG A 
AAGATAATCT 


GAACGTATGC 
CTTGCATACG 


GTTATCTCCA 
CAATAGAGGT 


TCCTTAATCT 
AGGAATTAGA 


CAGTTGTTTG 
GTCAACAAAC 


1261 


CTTCAAGGAC 
GAAGTTCCTG 


CTTTCATCTT 
GAAAGTAGAA 


CAGGATTTAC 
GTCCTAAATG 


AGTGCATTCT 
TCACGTAAGA 


GAAAGAGGAG 
CTTTCTCCTC 


ACATCAAACA 
TGTAGTTTGT 


1321 


GAATTAGGAG 
CTTAATCCTC 


TTGTGCAACA 
AACACGTTGT 


GCTCTTTTGA 
CGAGAAAACT 


GAGGAGGCCT 
CTCCTCCGGA 


AAAGGACAGG 
TTTCCTGTCC 


AGAAAAGGTC 
TCTTTTCCAG 


1381 


TTCAATCGTG 
AAGTTAGCAC 


GAAAGAAAAT 
CTTTCTTTTA 


TAAATGTTGT 
ATTTACAACA 


ATTAAATAGA 
TAATTTATCT 


TCACCAGCTA 
AGTGGTCGAT 


GTTTCAGAGT 
CAAAGTCTCA 


1441 


TACCATGTAC 
ATGGTACATG 


GTATTCCACT 
CATAAGGTGA 


AGCTGGGTTC 
TCGACCCAAG 


TGTATTTCAG 
ACATAAAGTC 


TTCTTTCGAT 
AAGAAAGCTA 


ACGGCTTAGG 
TGCCGAATCC 


1501 


GTAATGTCAG 


TACAGGAAAA 


AAACTGTGCA 


AGTGAGCACC 


TGATTCCGTT 


GCCTTGCTTA 
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15 61 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA 

TGAGATTTCG AGGTACAGGA CCCGGATTTT 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA 
AAACGAGTAT AAGTGTATAC ATTTGGTCTT 

16 81 AAAGGAACTA TGTTGCTATG AATTAAACTT 

TTTCCTTGAT ACAACGATAC TTAATTTGAA 

1741 TCATATTTCT TATTAAAATT TCTGCCATTT 
AGTATAAAGA ATAATTTTAA AGACGGTAAA 

18 01 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT 
TCTCTATTTG GACTTTTCTT CTCACCGGAA 

18 61 TGTTTCATTG TGTACATTTT TATATTCTCC 

ACAAAGTAAC ACATGTAAAA ATATAAGAGG 

1921 TCTTGTTAAA TATATCTATT TTTACCAAAG 
AGAACAATTT ATATAGATAA AAATGGTTTC 

19 81 GATCAACTAT TTTTAGCTTG GTAAATTTTT 

CTAGTTGATA AAAATCGAAC CATTTAAAAA 

2 041 AAGATGATAT AAAATATTGT TGCTCTGACA 
TTCTACTATA TTTTATAACA ACGAGACTGT 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA 
GATCTCAATC TAATTAGACG TAAAATTTTT 

2161 AAGACTTTTT GAAAATAATT AAATT AT C AT 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA 

2 221 TAAAAAGCAA CTTATGAAAG TAGACATTCA 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT 

22 81 TGGGGAAATC TGAGCCTAGC TCAGAAAAAC 

ACCCCTTTAG ACTCGGATCG AGTCTTTTTG 

23 41 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA 

AGGACTATTT CGCACGACAC GACACGTCAT 

2401 GTTTTATTAT CTTAAACTCT GTTCCATACA 
CAAAATAATA GAATTTGAGA CAAGGTATGT 

24 61 CAGAAGTATG TCTCT 

GTCTTCATAC AGAGA 



TCGTATAAAA TCTGGATTTT TTTTTTTTTT 
AGCATATTTT AG AC C T AAAJV AAAAAAAAAA 

CATTCTATGT ACTACAAACC TGGTTTTTAA 
GTAAGATACA TGATGTTTGG ACCAAAAATT 

GTGTCGTGCT GATAGGACAG ACTGGATTTT 
CACAGCACGA CTATCCTGTC TGACCTAAAA 

AGAAGAAGAG AACTACATTC ATGGTTTGGA 
TCTTCTTCTC TTGATGTAAG TACCAAACCT 

ATCTTCACTT TATCGATAAG CCAGTTTATT 
TAGAAGTGAA ATAGC TATTC GGTCAAATAA 

TTTTGACATT ATAACTGTTG GCTTTTCTAA 
AAAACTGTAA TATTGACAAC CGAAAAGATT 

GTATTTAATA TTCTTTTTTA TGACAACTTA 
CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

CTAAACACAA TTGTTATAGC CAGAGGAACA 
GATTTGTGTT AACAATATCG GTCTCCTTGT 

AAAATACATG TATTTCATTC TCGTATGGTG 
TTTTATGTAC ATAAAGTAAG AGCATACCAC 

AC TG AATTGG AATAGAATTG GTAAGTTGCA 
TGACTTAACC TTATCTTAAC CATTCAACGT 

ATCTTCCATT CCTGTTATTG GAGATGAAAA 
TAGAAGGTAA GGACAATAAC CTCTACTTTT 

GATCCAGCCA TTACTAACCT ATTCCTTTTT 
CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

ATAAAGCACC TTGAAAAAGA CTTGGCAGCT 
TATTTCGTGG AACTTTTTCT GAACCGTCGA 

GGAACACATC CTATTTATTG TGATGTTGTG 
CCTTGTGTAG GATAAATAAC AC T AC AAC AC 

CTTGTATAAA TACATGGATA TTTTTATGTA 
GAACATATTT ATGTACCTAT AAAAATACAT 
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New Sequence + Incyte ESTs 



1 ATTTGTTTAA ACCTTGGGAA ACTGGTTCAG GTCCAGGTTT TGCTTTGATC CTTTTCAAAA 
TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

.61 ACTGGAGACA CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTC TGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC C AG AG C AGG C TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GTCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGTGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLeu LeuLeuThrSer AlaLeuAl 

] 

2 41 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AGAGTGGGGT CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+ 2 aGlyGlnArg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 



3 01 C GG C C AG AG A CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTC C AG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

+ 2 rAsnLysGlu GlnTyrGlyVal GlnAspPro GlnHisGlu ArgllelleThr ValSerTh 



3 61 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

+ 2 rAsnGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLeuVa 



421 TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAGTGTCGG GTTC CAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

+ 2 lTrpArgLeu ValAlaValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



4 81 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CATCGTCATC TCCTTTTACA TACCTATGTT GAATGCAAAC TACTTTCTAA 

+ 2 eGlyLeuGlu AspProGluAsp AspIleCys LysTyrAsp PheValGluVal GluGluPr 



5 41 TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGGGTCTTC T AC TGTAT AC GTTCATACTA AAACATCTTC AACTCCTTGG 

+ 2 oSerAspGly ThrlleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+ 2 eSerLysGly AsnGlnlleArg IleArgPhe ValSerAsp GluTyrPhePro SerGluPr 



6 61 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGAC TTGG 
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+ 2 oGlyPheCys IleHisTyrAsn IleValMet ProGlnPhe ThrGluAlaVal SerProSe 



AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT TGTAACAGTA CGGTGTTAAG TGTC TTCG AC ACTCAGGAAG 

rValLeuPro ProSerAlaLeu ProLeuAsp LeuLeuAsn AsnAlalleThr AlaPheSe 



1 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGC TATAA CTGCCTTTAG 
TCACGATGGG GGAAGTCGAA ACGGTGACCT GGACGAATTA TTACGATATT GACGGAAATC 

+ 2 rThrLeuGlu AspLeuIleArg TyrLeuGlu ProGluArg TrpGlnLeuAsp LeuGluAs 



8 41 TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA 
ATGGAACCTT CTGGAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+ 2 pLeuTyrArg ProThrTrpGln LeuLeuGly LysAlaPhe ValPheGlyArg LysSerAr 



9 01 TC TAT ATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+ 2 gValValAsp LeuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



9 61 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThrAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TCCATAAGGG AAG AAC T AAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+ 2 sLeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys LeuHisAsnCys AsnGluCy 



10 81 TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal ThrLysLys TyrHisGlu ValLeuGlnLeu ArgProLy 



41 TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAGACCAAA 
AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC CAGGAAGTCA ACTCTGGTTT 

+2 sThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 



1 GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA 
CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC CGGGACCTCG TGGTACTCCT 

+ 2 uCysAspCys ValCysArgGly SerThrGly Gly 



12 61 GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG CATCACCACC AGCAGCTCTT 

CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

1321 GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 
CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT TGCATACGCA ATAGAGGTAG 

13 81 CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCTGA 

GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT CCTAAATGTC ACGTAAGACT 
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14 41 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC TCTTTTGAGA GGAGGCCTAA 

TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG AGAAAACTCT CCTCCGGATT 

15 01 AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA AATGTTGTAT TAAATAGATC 

TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT TTACAACATA ATTTATCTAG 

15 61 ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG CTGGGTTCTG TATTTCAGTT 

TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

16 21 CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA ACTGTGCAAG TGAGCACCTG 

GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

1681 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT CCATGTCCTG GGCCTAAAAT CGTATAAAAT 
TAAGGCAACG GAACCGAATT GAGATTTCGA GGTACAGGAC CCGGATTTT A GCATATTTTA 

17 41 CTGGATTTTT TTTTTTTTTT TTGCGCATAT TCACATATGT AAACCAGAAC ATTCTATGTA 

GACCTAAAAA AAAAAAAAAA AACGCGTATA AGTGTATACA TTTGGTCTTG TAAGATACAT 

18 01 CTACAAACCT GGTTTTTAAA AAGGAACTAT GTTGCTATGA ATTAAACTTG TGTCATGCTG 

GATGTTTGGA CCAAAAATTT TTCCTTGATA CAACGATACT TAATTTGAAC ACAGTACGAC 

1861 ATAGGACAGA CTGGATTTTT CATATTTCTT ATTAAAATTT CTGCCATTTA GAAGAAGAGA 
TATCCTGTCT GACCTAAAAA GTATAAAGAA TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

1921 ACTACATTCA TGGTTTGGAA GAGATAAACC TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 
TGATGTAAGT ACCAAACCTT CTCTATTTGG ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

19 81 ATCGATAAGT CAGTTTATTT GTTTCATTGT GTACATTTTT ATATTCTCCT TTTGACATTA 

TAGCTATTCA GTCAAATAAA CAAAGTAACA CATGTAAAAA TATAAGAGGA AAACTGTAAT 

2041 TAACTGTTGG CTTTTCTAAT CTTGTTAAAT ATATCTATTT TTACCAAAGG TATTTAATAT 
ATTGACAACC GAAAAGATTA GAACAATTTA TATAGATAAA AATGGTTTCC ATAAATTATA 

2101 TCTTTTTTAT GACAACTTAG ATCAACTATT TTTAGCTTGG TAAATTTTTC TAAACACAAT 
AGAAAAAATA CTGTTGAATC TAGTTGATAA AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

2161 TGTTATAGCC AGAGGAACAA AGATGATATA AAATATTGTT GCTCTGACAA AAATACATGT 
ACAATATCGG TCTCCTTGTT TCTACTATAT TTTATAACAA CGAGACTGTT TTTATGTACA 

22 21 ATTTCATTCT CGTATGGTGC TAGAGTTAGA TTAATC TGC A TTTTAAAAAA CTGAATTGGA 
TAAAGTAAGA GCATACCACG ATCTCAATCT AATTAGACGT AAAATTTTTT GACTTAACCT 

22 81 ATAGAATTGG TAAGTTGCAA AGACTTTTTG AAAATAATTA AATTATCATA TCTTCCATTC 

TATCTTAACC ATTCAACGTT TCTGAAAAAC TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

23 41 CTGTTATTGG AGATGAAAAT AAAAAGCAAC TTATGAAAGT AGACATTCAG ATCCAGCCAT 

G AC AATAAC C TCTACTTTTA TTTTTCGTTG AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

24 01 TACTAACCTA TTCCTTTTTT GGGGAAATCT GAGCCTAGCT CAGAAAAACA TAAAGCACCT 

ATGATTGGAT AAGGAAAAAA CCCCTTTAGA CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

2461 TGAAAAAGAC TTGGCAGCTT CCTGATAAAG CGTGCTGTGC TGTGCAGTAG GAACACATCC 
ACTTTTTCTG AACCGTCGAA GGACTATTTC GCACGACACG ACACGTCATC CTTGTGTAGG 

2 521 TATTTATTGT GATGTTGTGG TTTTATTATC TTAAACTCTG TTCCATACAC TTGTATAAAT 
ATAAATAACA CTACAACACC AAAATAATAG AATTTGAGAC AAGGTATGTG AACATATTTA 
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2 5 81 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 

TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2 641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 

GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC 

2 7 01 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 

AATACACCAC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

2761 TTTTGGGGAG AAAATT 
AAAACCCCTC TTTTAA 



Additional oligonucleotides used for amplification of entire 
coding region 

5-1 TTTGTTTAAACCTTGGGAAACTGG 
5-2 GTCCAGGTTTTGCTTTGATCC 
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S~/Cr.*J?. DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 

1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC TTTTCAAAAA 
AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT GGAAACTACC 
GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCTA C CCTAATAC A CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 
GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGGAC 

181 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 
CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+ 2 MetSer LeuPheGly LeuLeuLeu LeuThrSerAla LeuAlaGl 

] 

2 41 CACCGCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGG 
GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GACTGTAGAC GGG AC CGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSerAsn LeuSerSei LysPheGlnPhe SerSerAs 



3 01 CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GGACTCATCA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 



3 61 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT TAATAATGAC ACAGATGATT 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal LeuValTr 



421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnLeu ThrPheAspGlu ArgPheGl 



4 81 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProGluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 



5 41 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACTTCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC 

+2 rAspGlyThr IleLeuGlyArg TrpCysGly SerGlyThr ValProGlyLys GlnlleSe 



6 01 TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG 

+2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GluProGl 



661 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCAGG 
ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGTCC 
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+2 yPheCysIle HisTyrAsnlle ValMetPro GlnPheThr GluAlaValSer ProSerVa 



721 GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA GTCCTTCAGT 
CAAGACGTAG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAGTCA 

+2 lLeuProPro SerAlaLeuPro LeuAspLeu LeuAsnAsn AlalleThrAla PheSerTh 



7 81 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGGGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

+2 rLeuGluAsp LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAspLe 



841 CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

+2 uTyrArgPro ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SerArgVa 



9 01 ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+ 2 lValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCysThrPro ArgAsnPh 



9 61 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLe 



1021 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

+ 2 uLeuValLys ArgCysGlyGly AsnCysAla CysCysLeu HisAsnCysAsn GluCysGl 



10 81 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+ 2 nCysValPro SerLysValThr LysLysTyr HisGluVal LeuGlnLeuArg ProLysTh 



1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACAGGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+ 2 rGlyValArg GlyLeuHisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



12 01 CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGTCC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGG TACTCCTCAC 

+ 2 sAspCysVal CysArgGlySer ThrGlyGly 



12 61 TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

1321 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 

13 81 AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 

TTAGAGTCAA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAG AC TTTC 

1441 AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 
TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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Predicted Full-length Polypeptide Sequence 



1 MSLFGLLLLT S A LAGQRQGT QAESNLSSXF QFSSNXEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYPRN TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDI CKVDFVE VEEPSDGTIL GRWCGSGTVP GXQISXGNQI RIRFVSDEYF 

151 PSSPGFCIHY NIVMPQFTEA VSPSVLPPSA LPLDLLNNAI TAFSTLEDLI 

201 RYLEPERWQL DLEDLYRPTW QLLGXAFVFG RXSRWDLNL LTEEVRLYSC 

251 TPRNFSVSIR EELXRTDTIF WPGCLLVKRC GGNCACCLKN CNECQCVPSK 

301 VTXXYHEVLQ LRPXTGVRGL HXSLTDVALE HHEECDCVCR GSTGG 
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s^te ft? Alignment of VEGF-X with Other VEGFs 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



20 



40 



MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 



50 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_ HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



60 



80 



100 



TVSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPE 



100 



VEGF_HUMAN 
PLGF — HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD__HUMAN 
99012 6vegx 



120 



140 



MHLLGFFSVACSLLAAALLPGPREAPAAAA 

MYREWWVNV 

DDICKYDFVEV--EEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDE 



30 
10 
148 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99 012 6vegx 



160 



180 



200 
— MN 
—MP 



AFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKM 
FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE 
YFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLED 



2 
2 

80 
60 
198 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



* 220 * 240 * 

FLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFMD-VYQiaSY 
VMRLFPCFLQLLAGLALPAVPPQQWALSAGNGSSEVEWPFQE-WGgSY 

MSPLLRRLLLAALLQLAPAQAPVSQPDAPGHQRKWSWID-VYTgAT 

YKCQLRKGGWQHNREQANLNSRTEETIKFAAAHYNTEILKS IDNEWR StQ 

dwklwrcrlrlksftsmdsrsashrstrfaatfydietlkvideewqgtq 
liryleperwqldledlyrptwqllgkafvfgrksrwdlnllteev3ly 



51 
51 
46 
130 
110 
248 



vegf_human 
plgf_human 
vegb_human 
vegc_human 
vegd__human 

990126vegx 



260 

CHPIETLVDIFQ 
CRALERLVDWS 



CQPREWVPLTV a LMGTVAKQLV 
CMPREVCIDVGK □ FGVATNTFFK 



CSPRETCVEVAS 



aYPDEIEYIFK 
YPSEVEHMFS 



SCTPRNFSVSIRgELKRTDTIFWgG 



g LGKSTNTFFK 



280 








300 




cc 


ND — EGLE 


5ffip 


96 


cc 


GD--ENLH 


W 


96 


cc 


PD--DGLE 


mp 


91 


cc 


NS--EGLQ 


9n 


175 


cc 


ME--ESLI 


SEn 


155 


cc 


LHNCNECQ 


IF 


298 
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★ 






VEG F — HUMAN 


• TEESNITMQ 


i 


MR 




PLGF_HUMAN 


VETANVTMQ 


L 


LK 


I 


VEGB_HUMAN 


TGQHQVRMQ 


I 


LM 


I 


VEGC__HUMAN 


TSTSYLSKT 


L 


FE 


I 


VEG D_ HUMAN 


TSTSYISKQ 


L L 


FE 


I 


99012 6vegx 


. SKVTKKYHE 


V 


LQ 


U 



320 

KPHQG QHIGEjiSFLQ 

RSGDR PSYVEMTFSQ 

RYPS . — SQLGEWSLE - 

TVPLSQG PKPVTHSF. 

SVPLTSV PELVPMKVAISJ 

RPKTGVRGLHKSLTDfflALE: 



340 * 

JeHrPKXDRARQEK : 141 

JRPLREKMKPER : 141 

JRPKKKDSAVKP : 13 5 

JMSKLDVYRQVH : 22 2 

jKSLPTAPRHPYSI : 2 02 

SdHvCRGSTGG : 345 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB__HUMAN 
VEGC_HUMAN 
VEG D__HUMAN 
99 012 6vegx 



360 * 380 * 400 
KSVRGKGKGQKRKRKKSRYKSWSVP 

DSPR 

SIIRRSLPATLPQCQAANKTCPT^miWNNHICRCLAQEDFMFSSDAGDDS 
I RRS I Q I PEEDRC SHSKKLCP I DMLWDSNKCKC VLQEENPLAGT 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



* 420 * 440 * 

TDGFHDICGPNKELDEETCQCVCRAGLRPASCGPKKELDRNSCQCVCKNK 
EDHSHLQEPALCGP 



322 
260 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEG C_HUMAN 
VEGD„HUMAN 
99 0126vegx 



460 * 480 * 500 

CGPCSERRKHLFVQDPQTCKC-SCKNTDSRCKARQLELNER : 20 6 

CGDAVPRR : 14 9 

PLCPRCTQHHQRPDPRTCRCRCRRRSFLRCQGRGLELNPD : 179 

LFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLK : 372 

HMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPD : 310 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



520 



540 



TCRCDKPRR- 



TCRCRKLRR 

GKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS 

TCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 



215 

188 
419 
354 
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Su?. Variant Polypeptide Sequences 



FL_seq 

clone41 

clone20 



20 



40 



SLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
SLFGLIiLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
SLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 



50 
50 
50 



FL_seq 

clone41 

clone20 



60 



80 



100 



TVSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPE 
TVSTNG S I HS PRFPHT YPRNTVLVWRLVAVEENVWI QLTFDERFGLEDPE 
TVSTINf G S I H S PRFPHTYPRJSfTVLVWRLVAVEEIJVWI QLTFDERFGLEDP~ 



100 
100 
100 



FL__seq 

clone41 

clone20 



120 



140 



DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKOISKGNOIRIRFVSDEYF 



150 
150 
150 



FL_seq 
clone41 
clone2 0 



PSEP 
PSEP 1 
PSEP 



160 * 180 * 200 

GFCIHYNg^PQFTEAVSPSVLPPSALPLDLLlSnsrAITAFSTLEDLI : 200 

SNRGGKIMQWHTS : 167 

GFCIHYNyV^PQFTEAVSPSVXjPPSALPLDLLKNAITAFSTLEDLI : 20 0 



FL__seq 
clone41 
clone2 0 



* 220 * 240 * 

RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLYSC 



RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRVVDIiNLLTE- 



250 
243 



FL__seq 
clone41 
clone2 0 



260 * 280 * 300 

T PRNF S VS I REELKRTDT I FWPGC L L VKRCGGNC AC C LHNCNE C QC VP SK 



300 



FL_seq 
clone41 
clone2 0 



* 320 * 340 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 

EVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



345 
282 
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Primers for Expression of VEGF-X 

E.coli expression of domain- 

vegx-6 AATTGGATCCGAGAGTGGTGGATCTGAACC 

vegx-7 AATTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTGAATTCATTA I I I I I I AGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

Bacuiovirus/insect celt expression of full-length protein - 
vegbad 

AATTGGATCCGGAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegbac2 AATTGAATTCGCTATCCTCCTGTGCTCCCTCTGC 
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>3 9 9 3 1 8 0H1 LUNGNON0 3 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOTO 1 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCT^ 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

> 2 5 5 9 8 7 0H1 ADRETUTO 1 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3 979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTG 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 

GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 

AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNTO 1 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>1302516H1 PLACNOTO 2 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATA.C 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 
ACTAAAAAATACCACGAGGTCC 

>3 684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>4713188H1 BRAIHCTO 1 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 

ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 

CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 
TTGCT 

>458823H1 KERANOTO 1 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 
GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTG 
CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAJVATGTTGTATTA--. 
ATAGATC 

> 1 3 0 3 9 0 9H1 PLACNOTO 2 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
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>273 9211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>37335 65H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATOGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 

CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGN 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT01 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>4163378H1 BRSTNOT32 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 
GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 
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>2054 67 5H1 BEPINOT01 INCYTE 

AAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATT 

TCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACT^ 

TATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAA 

TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

>3993180H1 LUNGNON0 3 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACC^T 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>4164633H1 BRSTNOT32 INCYTE 

CTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTANTTATGACAACTTAGATCAACTATTTTTAGCTTG 

GTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATG 

TATTTCATTCTCGTATGGTGCTAGAGTTAGATTAJ^TCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCA 

AAGACTTTTTGANAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGGGGAGAAAAT 

>2559870H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 
TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTC^ 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3817470H1 BONSTUT 0 1 INCYTE 

TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAA 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTC 

ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTC 

TAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTT 

>3 97 97 67H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAG AGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTG 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACG T ATGC 
GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCC ACAATT 

GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 

AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT 0 1 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>862169H1 BRAITUT03 INCYTE 

AGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCA 

TTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTC 

CTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTT 
GGGGAAATCTGAGCCTAGC 

>4201385H1 BRAITUT29 INCYTE 

TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTAT 
TAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATCT 
TCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



23 / 54 



PCT/US99/30503 



CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 
>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 
ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>254972 0H1 LUNGTUT06 INCYTE 

TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAA 
AATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGT 
AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACT 
TATGANAGTAG 

>877279H1 LUNGAST01 INCYTE 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCAT 
TTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGGCTTTTTGAAAATAATTAAATTATCATATCTTCCATTCC 
TGTTATTGGNGG 

>4713188H1 BRAIHCTO 1 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 
TTGCT 

>2171082H1 ENDCNOT03 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTA 
TATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 
CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGA 

>875 860H1 LUNGAST01 INCYTE 

CTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACC 
TGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCT 
TTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTAT 
GAC 

>706168H1 SYNORAT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGANCTATGTTGCTATGAAT 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCANTTTATCGATAAGTCAGTTTATTTGT 
TTCA 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 
GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTG 
CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAA 
ATAGATC 

>53 843 6H1 LNODNOT02 INCYTE 

AAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCAT 
TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
>13 03909H1 PLACNOT02 INCYTE 

AGGAAATC AAATTAGGATAAGATTTGT ATCTGATGAATATTTTC CTTCTGAACC TTC T AAC AG AGGAGGTAAG ATT ATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCT 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
>273 9211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 
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>2550343H1 LUNGTUT06 INCYTE 

TGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCNAATCTTGTTAAATATATCTATTTTTACCAAAG 
GTATTTAATA.TTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGC 
CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 
>5321148K1 FIBPFEN06 INCYTE 

CACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGNCAAAAATACATGTATTTCATTCTCGTA 
TGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAA 
TAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAAATTCAGATCCAC 
CATTACTAAC 

>879495H1 THYRNOT02 INCYTE 

ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAA 
AGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGT 
AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCT 
TGAAAAA 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>543890H1 OVARNOT02 INCYTE 

TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 
TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAyAAACTGAATTGGNATAGAATTGGTAAGTTGCAAAGNCTT 
TTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGGATGGAAAATAAAAAGCAACTTATGGAAAGTAGG 
ACATTCAGATC 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>464193 9H1 PROSTMT03 INCYTE 

GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 
TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGNAGAGATAAACCTGAAAA 
GAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATGTGTACATTTTTATATTCTCCTTTGACAT 
ATAACGTGGCTTT 

>2007780H1 TESTNOT03 INCYTE 

TTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAAT 
ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAAC 
AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 
>3085331H1 HEAONOT03 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATTTGCTATGAATT 
AAACTTGTGTCGTGCTGATAGGACAGACTGGNTTTTTCATATTTCTTATTANAATTTCTGCCATTAGAAGAAGAGAACTA 
CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTCACTTTATCGATAAGTCAGT 
>3414043H1 PTHYNOT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAAT 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAA 
>3705 963H1 PENCNOT07 INCYTE 

ANACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAA 

TCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTA 

AAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

TTCTGCCATTAGAAGAAGAGAACTACNTTCANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 

>5137 051H1 OVARDIT04 INCYTE 

AAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTNTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGT 

TATTGGAGATGAANATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGG 

AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTNTGTCA 

GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 

>3 55422 3H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 
ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 
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CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGN 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT01 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 

GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>195 564 6H1 CONNNOT01 INCYTE 

TGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGC 
AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAA 

ACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTATATCCTAAACC 

>4163378H1 BRSTNOT3 2 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 

>5095141H1 EFIMNON05 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATNTTCACTTTATCGATAAGTCAGNTTATTTGTTTCATTGTGTACATTTNNA 

TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCTAANCNTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 
CTTT 

>943826H1 ADRENOTO 3 INCYTE 

TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAA 

AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 

>3451273H1 UTRSNONO 3 INCYTE 

TTTTTTNTTTTGCTCATATTCACATATGTAAACCNGAACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAACTATG 

TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCANATTTCTTANTAANNTTTCTGCCATTTAG 
AAGA 

>1402278H1 LATRTUTO 2 INCYTE 

GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 

ATCGTATAAAATCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAA 

CCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTT 
CTTA 

>43 6I191H1 SKIRNOT01 INCYTE 

GCAAAGACTTTTTGANAATNATTAANTTATCATATCTTCCATTCCTGTTATNGGAGATGANAATAAAAAGCAACTTATGA 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 

ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCNATTTATTGTGNTGTN 
GNGGTTTTATGATC 

>1307017H1 PLACNOTO 2 INCYTE 

TGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGC 

CTAAAATCGTATAAAATCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATGTAAACCAGAACATTCTATGTACT 

ACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCA 
TAT 

>5032225H1 HEARFETO 3 INCYTE 

AATTATCATATCTTCCATTCCTGTTATTGGAGATGNAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCAT 

TACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 

CTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCGTT 
CCAT 

>3732621H1 SMCCNOSO 1 INCYTE 

ANAGATGATATAAAANATTGTTGCTCTGACAANNATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 

CNTTTTAAAAAACTGANTTGGAATAGANTTGGTAAGTTGCAAAGNCNTTTGAAAATNATTAAGTTATCAGAT 

> 3 5 3 02 7 4H1 BLADNOTO 9 INCYTE 

TTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATT 

CCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCG 

TGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCCATACACTTG 

TATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCA 

>3 5 3 02 4 9H1 BLADNOTO 9 INCYTE 

CTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATCCAGCCATTACTAACC^T 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGC 
GTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACT 
TGTATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACCTGG 
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VEGFE1 AAAATGTATGGATACAACTTAC 22 

VEGFE2 GTTTGATGAAAGATTTGGGCTTG 23 

VEGFE3 TTTCTAAAGGAAATCAAATTAG 22 

VEGFE4 GATAAGATTTGTATCTGATG 20 

VEGFE5 GATGTCTC CTCTTTC AG 17 

VEGFE6 GCACAACTCCTAATTCTG 18 

VEGFE7 AGCACCTGATTCCGTTGC 19 

VEGFE8 TAGTACATAGAATGTTCTGG 20 

VEGFE9 AAGAGACATACTTCTGTAC 19 

VEGFE10 CCAGGTACAATAAGTGAACTG 21 
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+ 3 M N I F L L 

NLLT EEV RLY 

] 



I AGGAAATCAA ATT AG G AT AA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACT AC TT AT A AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TT CTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDT IFWP GCL 

j 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 

-2 < 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTC CCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV LQLR PKT GVR 
GLHK SLT D V A 



+ 1 V S G 

DCT NHSP T W P 

] 



2 41 CTAAAAAATA CCACGAGGTC CTTCAGTTGA G AC CAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 

_2 

[ 

+ 3LEHH EEC DCV CRGS TGG 
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> 

+ 2 VQREHRR 
I A A S PPA ALA 

] 



+ 1 WST M R S V TVC AEG AQED 
SRI TTS SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC AC TG AC AC AC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QS .CA VQW L I L LENV CVI 
SIL NLSC LLQ 



+ 1 P E L C SAV ADSI RER MRY 

LHP 
> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 



+2 GPF IFRI YSA F 
> 

4 81 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTC GAGA 



561 TTTGAGAGGA GGC CTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGGACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 



641 AGCTAGTTTC AGAGTTACCA TGTAC GTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TC GAT AC GGC TTAG GGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 

CAGTCATGTC C TTTTTTTG A CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG 



801 CTAAAATCGT ATAAAATCTG GA 
GATTTTAGCA TATTTTAGAC CT 
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+ 3 M N I F L L 

NLLT EEV RLY 

j 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDT IFWP GCL 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 

-2 < 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GG AC C AATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV LQLR PKT GVR 
GLHK SLT D V A 



+ 1 
T N 



V 



H 



W 



241 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 
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■2 
- [ 



+ 3LEHH EEC DCV CRGS TGG 

> 

+ 2 VQREHRR 
I A A S PPA ALA 

] 



+ 1 WST MRSV TVC AEG AQED 
SRI TT3 SSC 



3 21 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT GGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QSCA VQW L I L LENV CVI 
SIL NLSC LLQ 



+ 1PELC SAV ADSI RER MRY 

LHP 



4 01 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 GPF IFRI Y 3 A F 



481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTC C TCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTAC GTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGC TTAACTC TAAAGCTCCA TGTCCTGGGC 
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CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGAC TAA GGCAACGGAA 
CCGAATTGAG ATTTC GAGGT ACAGGACCCG 

801 CTAAAATCGT ATAAAATCTG GATTTTTTTN TTTTTTTTTG CGCATATTCA 
CATATGTAAA C C AG AAC ATT CTATGTACTA 

G ATTTT AG C A TATTTTAGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT 
GTATACATTT GGTCTTGTAA GATACATGAT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 
CGTGCTGATA GGACAGACTG GATTTTTC AT 

GTTTGGAC C A AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA 
GCACGACTAT CCTGTCTGAC CTAAAAAGTA 

-3 < 



9 61 ATTTC TTATT AAAATTTCTG CCATTTAGAA GAAGAGAACT ACATTCATGG 
TTTG GAAG AG ATAAACCTGA AAAGAAGAGT 

TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTACC 
AAACCTTCTC TATTTGG AC T TTTCTTCTCA 

-3 



10 41 GGCCTTATCT TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA 
CATTTTTATA TTCTC CTTTT GACATTATAA 

CCGGAATAGA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAACACAT 
G T AAAAAT AT AAGAGGAAAA CTGTAATATT 
_ 3 f 

1121 CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 
TTAATATTCT TTTTTATGAC AACTTAGATC 

GACAACCGAA AAGATTAGAA CAATTTATAT AGATAAAAAT G GTTTC CAT A " 
AATTATAAGA AAAAAT ACT G TTGAATCTAG 

12 01 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGC C AG A 
GGAACAAAGA TGATATAAAA TATTGTTGCT 

TTGATAAAAA TCGAACCATT TAAAAAGATT TGTGTTAACA ATATCGGTCT 
CCTTGTTTCT ACTATATTTT AT AAC AAC G A 

12 81 CTGACAAAAA TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA 
ATCTGCATTT TAAAAAACTG AATTGGAATA 

GACTGTTTTT ATGTACATAA AG TAAG AGC A TACCACGATC TCAATCTAAT 
TAGAC GTAAA ATTTTTTGAC TTAACC TTAT 

13 61 GAATTG GTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATC AT ATCT 
TCCATTCCTG TTATTGGAGA TGAAAATAAA 

CTTAAC C ATT CAACGTTTCT GAAAAACTTT TATTAATTTA ATAGTATAGA 
AGGTAAGGAC AATAACCTCT AC TTTT ATTT 

1441 AAGCAACTTA TGAAAGTAGA CATTCAGATC CAGCCATTAC TAAC CTATTC 
CTTTTTTGGG GAAATCTGAG CCTAGCTCAG 

TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTCGGTAATG ATTGGATAAG 
GAAAAAACCC CTTTAGACTC GGATCGAGTC 
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1521 AAAAACATAA AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT 
GCTGTGCTGT GCAGTAGGAA CACATCCTAT 

TTTTTGTATT TCGTGGAACT TTTTCTGAAC CGTCGAAGGA CTATTTCGCA 
CGACACGACA CGTCATCCTT GTGTAGGATA 

16 01 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 
TATAAATACA TGGATATTTT TATGTACAGA 

AATAACACTA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC 
ATATTTATGT AC C TATAAAA ATACATGTCT 

1681 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TCATACAGAG AATTGGTCAA GTGAATAACA TGGACC 
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f - DNA and polypeptide sequence used for mammalian cell expression 



m 



s 



1 



f 



9 



1 



1 



1 



1 t 



s 



a 1 a 



g q r 



1 GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

+ lqgtq aES NLS SKFQ F S S NKE 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTT7TCCAG CAACAAGGAA 

+ 1 Q N G V QDP QHE RIIT VST NGS 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAAT7ATTA CTGTGTCTAC TAATGGAAGT 

+ 1IHSP RFP H T Y PR NT VLV WRL 
181 ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT ATGGAGATTA 

+ 1 V A V E ENV WIQ LTFD ERF GLE 
241 GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT TGGGCTTGAA 

+ 1DPED DIC KYD FVEV £ E P SDG 
3 01 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

+ 1TILG RWC GSG TVPG KQI SKG 
361 ACT AT ATT AG GGCGCTGGTG TGGTTCTGGT ACT GT AC CAG GAAAACAGAT TTCTAAAGGA 

+ 1NQIR IRF VSD E V F P SEP GFC 
421 AATCAAATTA GGA7AAGATT TGTATCTGAT GAATATTTTC CTTC7GAACC AGGGTTCTGC 

+ 1IHYN IVM PQF TEAV SPS VLP 
481 ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC AGTGCTACCC 

+ 1PSAL ?LD L L. N NAIT AFS TLE 
541 CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG TACCTTGGAA 

+ 1DLIR YLE PER WOL D LED L Y R 
601 GACCTTATTC GA7ATCTTGA ACCAGAGAGA TGGCAGT7GG ACTTAGAAGA TCTATATAGG 

+ 1PTWQ LLG KAF VFGR K SR VVD 
661 CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG AGTGGTGGAT 

+ 1LNLL TEE VRL YSCT ?RN FSV 
721 CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA CTTCTCAGTG 

+ 1SIRE ELK RTD TIFW PGC LLV 
781 TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG TCTCCTGGTT 

+ 1KRCG GNC ACC LHNC NEC QCV 
841 AAACGCTGTG GTC-GGAACTG TGCCTGTTGT CTCCACAATT GCAA7GAATG TCAATGTGTC 

+ 1PSKV TKK YHE VLQL ?. PK TGV 
901 CCAAGCAAAG TTAC7AAAAA ATACCACGAG GTCCTTCAG7 TGAGACCAAA GACCGGTGTC 

+ 1RGLH K S I* TDV ALZH HEE CDC 
961 AGGGGATTGC ACAAA7CACT CACCGACGTG GCCCTGGAGC ACCA7GAGGA GTGTGACTGT 

+ 1VCRG STG G SP. G?"E GKP IPN 
10 21 GTGTGCAGAG GGAGCACAGG AGGATCTAGA GGGCCC7T7C- AAGG7AAGCC TATCCC7AAC 

+ 1 P L L G LPS T R T G H H H K K H 
1081 CCTCTCCTCG GTCTCGATTC TACGCGTACC GGTCA7CA7C ACCA7CACCA TTGA 
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/^yZP. DNA and polypeptide sequence used for baculovirus/insect cell expression 

1 GAATTCAAAG GCCTGTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

+ 3 mkf Ivn.valv fmv vyi syi 
61 ATATGAAATT CTTAGTCAAC GTTGCCCTTG TTTTTATGGT CGTATACATT TCTTACATCT 

+ 3 y a DP £ S H K H H K H E S NLS SKF 
121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+ 3QF SS N K E QNGV QDP QHE R I I 
181 AGTTTTCCAG CAACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+ 3TVST 51 G S I K S P RFP KTY PRN 
241 CTGTGTCTAC TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA 

+ 3TVLV WRL VAVE ENV WIQ LTF 

3 01 CGGTCTTGGT AT GGAGATT A GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

+ 3DERF GLE DPED DIC KYD FVE 
361 ATGAAAGATT TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+ 3 VEEP SDG TILG RWC GSG TVP 
421 TTGAGGAACC CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

+ 3GKQI SKG NQIR IRF VSD EYF 

4 81 GAAAACAGAT TTCTAAAGGA AATCAAATTA GGATAAGAT7 TGTATCTGAT GAATATTTTC 

+ 3PSEP GFC IKYN IVM PQF TEA 
541 CTTCTGAACC AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

+ 3VSPS V L P PSAL PLD LLN MAI 
601 TGAGTCCTTC AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATG CTATAA 

+ 3TAFS TLE DLIR YLE PER WQL 
6 61 CTGCCTTTAG TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG 

+ 3DLED LYR PTWQ LLG KAF VFG 
721 ACTTAGAAGA TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+ 3RKSR VVD LNLL TEE VRL YSC 
781 GAAAATCCAG AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

+ 3 T PRN FSV SIRE ELK RTD TIF 
841 CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT 

+ 3WPGC LLV KR CG GNC ACC LHN 
901 GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT 

+ 3CNEC QCV PSKV TKK THE VLQ 
961 GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT 

+ 3LRPK TGV RGLH.KSL TDV ALE 
1021 TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC 

+ 3HHEE SDC VCRG STG G 
1081 ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCTC TAGA 
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+ 3 OTN SSS NNNN NNK NNN LGI 

1 CGCAGACTAA TTCGAGCTCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG 

+ 3 E G R I 5 E F ESNL SSK FQF SSN 

61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATTCCAGTTT TCCAGCAACA 

+ 3KEQN GVQ DPQH ERI I T V STN 

121 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

+ 3GSIH SPR FPHT YPR NTV LVW 

181 GAAGTATTCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTGGTATGGA 

+ 3RLVA VEE NVWI QLT FDE RFG 

241 GATTAGTAGC AGTAGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

+ 3LEDP EDD ICKY DFV EVE EPS 

3 01 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+ 3DGTI LGR WCGS G T V PGK Q I S 

361 ATGGAACTAT ATTAGGGCGC TGGTGTGGTT CTGGTACTGT ACCAGGAAAA CAGATTTCTA 

+ 3 K G N Q IRI RFVS DEY FPS EPG 

421 AAGGAAATCA AATTAGGATA AGATTTGTAT CTGATGAATA 7TTTCCTTCT GAACCAGGGT 

+ 3FCIH Y N I VMPQ FTE AVS PSV 

481 TCTGCATCCA CTACAACATT GTCATGCCAC AATTCACAGA AGCTGTGAGT CCTTCAGTGC 

+ 3LPPS ALP LDLL NNA ITA ?ST 

541 TACCCCCTTC AGCTTTGCCA CTGGACCTGC TTAATAATGC TATAACTGCC TTTAGTACCT 

+ 3LEDL IRY LEPE RWQ LDL EDL 

601 TGGAAGACCT TAT TCGAT AT CTTGAACCAG AGAGATGGCA GTTGGACTTA GAAGATCTAT 

+ 3YRPT WQL LGKA FVF GRK SRV" 

661 ATAGGCCAAC TTGGCAACTT CTTGG CAAGG CTTTTGTTTT TGGAAGAAAA TCCAGAGTGG 

+ 3VDLN LLT EEVR LYS CTP RNF 

721 TGGATCTGAA CCTTCTAACA GAGGAGGTAA GATTATACAG CTGCACACCT CGTAACTTCT 

+ 3SVSI REE LKRT DTI FWP GCL 

781 CAGTGT C CAT AAGGGAAGAA CTAAAGAGAA CCGATACCAT TTTCTGGCCA GGTTGTCTCC 

+ 3LVKR CGG NCAC CLH N C N ECQ 

841 TGGTTAAACG CTGTGGTGGG AACTGTGCCT GTTGTCTCCA CAATTGCAAT GAATGTCAAT 

+ 3 C V ? S KVT KKYH EVL QLR PKT 

901 GTGTCCCAAG CAAAGTTACT AAAAAATACC ACGAGGTCCT TCAGTTGAGA CCAAAGACCG 

+ 3 G V ?. G L H K S L T D VAL EHH EEC 

961 GTGTCA3GGG ATTGCACAAA TCACTCACCG ACGTGGCCCT GGAGCACCAT GAGGAGTGTG 

+ 3 D C V C ?. G S T G G K H H ~. K H * 

1021 ACTGTG7GTG CA3AGGGAGC ACAGGAGGAC ATCATCACCA TCACCATTGA TCTAGAGTCG 

1081 ACCTGCAGGC AAGCTT 
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-^^T Disulphide*linked dimerisation of VEGF-X 

(A) Mammalian cell expression 



medium 



purified 



reduced monomer 



kDa 





putative nonreduced dimer 



(B) E.coli expression 
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DNA and polypeptide sequence used for E.coli expression of the PDGF-like domain 

+ 3 MR GSHK HHH HGM ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC ATCATCATCA TCATGGTATG GCTAGCATGA 

+ 3 TGGO OMG RDLY DDD DKD P G R 

61 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGGAAGAA 

+ 3KSRV VDL NLLT EEV RLY SCT 

121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC 

+3 PRNF SVS IREE LKR TDT IFW 

181 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG NCA CCL KNC 

241 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+ 3NECQ CVP SKV T KKY HEV L Q L 

301 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA 

+ 3RPKT GVR GLKK SLT D V A LEH 

361 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+ 3HEEC DCV CRGS TGG 

421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAATGAATT CGAAGCTTGA 

4 81 TCCGGCTGCT AACAAAGCCC 
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y/y^^ J?^- Expression of PDGF domain in E.coli 



1 2 3 
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DNA and polypeptide sequence used iorE.coli expression of the CUB-like domain 



+ 2 MA MDIG INS DPE 5HHH HHH 

1 GGCGATGGCC ATGGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+ 2 ESN u S S K FQF SSN KEQN GVQ 
61 TGAATCCAAC CTGAGTAGTA AATTCCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

+2 DPQ H E R I ITV STN GSIH S P R 
121 AGATCCXCAG CATGAGAGAA TTATTACTGT GTCTACTAAT GGAAGTATTC ACAGCCCAAG 

+ 2 FPH TYPR NTV LVW R L V A VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAGTAGAGGA 

+2 NVW IQLT FDE RFG LEDP SDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGATGA 

+ 2 ICK Y D F V EVE EPS DGTI LGR 
301 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAGT GATGGAACTA TATTAGGGCG 

+ 2 WCG SGTV PGK QIS KGNQ IRI 
361 CTGGTG7GGT TCTGGTACTG TACCAGGAAA ACAGATTTCT AAAGGAAATC AAATTAGGAT 

+ 2 RFV SDEY FPS EPG FCIH Y N I 
421 AAGATTTGTA TCTGATGAAT ATTTTCCTTC TGAACCAGGG TTCTGCATCC ACTACAACAT 

+ 2 VMP CFTE AV 
4 31 TGTCATGCCA CAATTCACAG AAGCTGTGTA GTCGAGCTCC GTCGACAAGC TTGCGGCCGC 

541 ACTCGAGCAC 
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Partial intron/eion structure of the VEGF-X gene 
(A) - Genomic DNA sequences of 2 exons determined by sequencing 



tttcttttataccatatagtgc.ggacctgaaccagGGTTCTGCATCCACTACAACATTGTCATGCCACAATTCACAGAAGCTGTG 
AGTCCTTCAGTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTAT 
TCGATATCTTGAACCAGAGAGATKCAGTTGGACTTAGAAGATCTATATATC 
TTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATT^ 

TCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTG 
TTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGg cagg t a t acaa 1 1 ttc 1 1 1 1 1 
ggttcccttcgggtattccatgrcct 

aaagccagtcatagacattcgttgacttttaaaagtggcttactcttattcccttccagGTCCTTCAGTTGAGACCAA?.GACCGGT 

GTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGG 

ATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCA^GGACCTTTC^^ 

GAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAG^ 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAG 
GGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCT 

TGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAA 
AATTTCTGCCATTTAGAAGAAGAGAAGTACATTCATGGTTTGGAAGAGA^ 

TCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTA 
AATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGA7CAACTATTT7TAGCTTGGTAAATTTTTCTAA 
ACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTG 
CTAGAGTTAGATTAATCTGCATTTTAAAAAACTGA^ 

TGATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTA^CCTAT 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTTC 

TGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAATACA 
TGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTC 

TGCTTGTAAAATGCTTAATATCGTGCCTAGGTTATGTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGT 
GGCTATTTTGGGGAGAAAATTatigcgcgcgcgtgcticaagatttactccttggactccgagaaaatgaaagataaa 
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(B) - Location of splice sites within the cDNA sequence 



1 GAATTCCCCC TTTTGTTTAA ACCTTGGGAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC 

61 TTTTCAAAAA CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGA TTATGT 

121 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCQAC CCCAGTGCAG 

181 CCTTCCCCTG GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG 

+ 3 MSLFGLLLLTS 
241 AGTGAGCTCT CACCCCAGTC AG CCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG 

+3 A L A G Q R Q G T Q A ESN L S S X F Q 
301 CCCTGGCCGG CCAGAGACAG GGGACTCAGG CGGAA TCCAA CCTGAGTAGT AAATTCCAGT 

+3 F S S N X E Q N G V Q D P Q HER I I T 
361 TTTCCAGCAA CAAGGAACAG AACGCAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG 

+3 V S T N G S I H S P R F P H T Y P R N T 
421 TGTCTACTAA TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG 

+3 V L V W R L V A V E E N V W I Q L T F D 
481 TCTTGGTATG GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG 

+3 E R F G LED P E D D X C K Y D F V E V 
541 AAAGATTTGG GCTTGAAGAC CCAGAAGATG ACATA TGCAA GTATGATTTT GTAGAAGTTG 

+3 E E P S D G T I It G R W C G S G T V P G 
€01 AGGAACCCAG TGATGGAACT A TA TTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA 

+3 K Q I S X G N Q I R I R F V S D E Y F P 
661 AACAGATTTC TAAAGGAAAT CAAA TTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT 

+ 3 S E P |g F C I KYNI VMP .QFT EAV 
721 CTGAACCAGG GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA 

+ 3SPSV L?P SALP L D L L N N AIT 
781 GTCCTTCAGT GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG 

+ 3AFST LED LIRY LEP E R W QLD 
841 CCTTTAGTAC CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT 

+ 3LEDL Y R P TWQL JO G K A F V FGR 
901 TAGAAGATCT ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TT7GGAAGAA 

+ 3KSRV VDL N" L L T E A // RLY SCT 
961 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGA^*7 AAGATTATAC AGCTGCACAC 

+ 3PRNF 5VS IREE.LKR TDT IFW 
1021 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG NCA CCL HNC 
10 81 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+ 3NECQ CVP SKVT KKY Kslv ! Q L 
1141 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCACTTGA 
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+ 3RPKT GVR GLKK SLT D V A LEH 

1201 GACCAAAGAC CGGTGTCAGG GCATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+ 3HEEC DCV CRGS TGG 

1261 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

1321 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

1381 CTCCATCCTT AArCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG CAACAGCTCT TTTGAGAGGA 

1501 GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1561 ATAGATCACC AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATT7TTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1351 CATGCTGATA GGACAGACTG GATTTTTCAT ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACATTCATGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

19 81 TCACTTTATC GATAAG7CAG TTTATTTGTT TCATTGTG7A CATTTTTATA TTCTCCTTTT 

2041 GACATTATAA CTGTTGGCTT TTCTAATCTT GTTAAATA7A TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT TTTTTATGAC AACTTAGA7C AACTATTT7T AGCTTGGTAA A7TTTTCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTGTTGCT CTGACAAAAA 

2221 TACATG7ATT TCAXTCTCGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT 7AAAAAACTG 

22 81 AATTGGAATA GAATTGGTAA GTTGCAAAGA C777T7GAAA A7AATTAAAT 7ATCA7ATCT 

23 41 TCCAT7CCTG TTAT7GGAGA 7GAAAA7AAA AAGCAAC77A TGAAAGTAGA CATTCAGATC 
2401 CAGCCA7TAC TAACCTATTC C7TTTT7GGG GAAATC7GAG CCTAGCTCAG AAAAACATAA 

24 61 AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCG7 GCTGTGCTGT GCAGTAGGAA 
2521 CACATCCTA7 TTAT7GTGA7 GTTGTGGT7T TAT7A7CTTA AACTCTGTTC CA7ACACTTG 
2 5 31 TATAAA7ACA 7GGATA77T7 7ATGTACAGA AGTATG7C7C 7TAACGAGTT CACTTA7TGT 
2641 ACCTGGAAGG GCGAATTCTG CAGATATC 
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A pplicanl's or;i**cn(*s 

fiicrelcrenec " B0192/7011WO 



liilcniaiinnal application No. 

PCT/US99/30503 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCT Rule \2bis) 



A. The indications made below relate to the deposited microorganism or other biological material referred to in the description 
on page ^1 iinc 15-16 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet | | 

Name of depositary' institution 

BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS (BCCM)™ 
LABORATORIUM VOOR MOLECULAIRE BIOLOGIE - PLASMIDENCOLLECTIE (LMBP) 

Address of depositary institution (including postal code and country) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Date of deposit 

20 December 1999 (20.12.99) 



Accession Number 
LMBP 3991 



C- ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not /or all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave hlank if not applicable) 



The indications listed below will be submitted to the International Bureau later (sf>eci/yfhe general nature of the indications e i» "Acccs 
Number ofDefHtsit") 



For receiving Office use only 



I | This sheet was received with the international application 



Authorized of ficer 



— — Tor International Bureau use only 
This sheet was received by thcjntc/niitioiuil (iyjeau on: 



.19 APRIL 2000 



Authorized officer 



Ellen Moyse 
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Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure 

Receipt in the case of an original deposit issued pursuant to Rule 7.1 by the 
International Depositary Authority BCCM™/LMBP identified at the bottom of next page 

international Form BCCM™/LMBP/BP/4/99-23 



To : Name of the depositor : Janssen Pharmaceutica N.V. 



Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



I. Identification of the microorganism: 

1.1 Identification reference given by the depositor: 



VEGF-X CUB PET22b 



1.2 Accession number given by the International Depositary Authority: 



LMBP 3991 
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II. Scientific description and/or proposed taxonomic designation 
The microorganism identified under I above was accompanied by: 

(mark with a cross the applicable box(es)) 

- a scientific description yes O no ["") 

- a proposed taxonomic designation yes Q no 5*3 

III. Receipt and acceptance 

This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on (date of original deposit) : December 20, 1 999 



IV. International Depositary Authority 



Belgian Coordinated Collections of Microorganisms (BCCM™) 

Laboratortum voor Moleculaire Biologie - Plasmidencoliectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized officials): 




Date : January 12, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 
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Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 

the Purposes of Patent Procedure 

Viability statement issued pursuant to Rule 10.2 by the International Depositary 
Authority BCCM™/LMBP identified on the following page 

International Form BCCM™/LMBP/BP/9/99«23 



To : Party to whom the viability statement is issued: 
Name : Dr Ftlip De Corte 



Address : Janssen Pharmaceutica N.V. 

Turnhoutseweg 30 
B-2340 Beerse 
Belgium 

I. Depositor: 

1.1 Name : Janssen Pharmaceutica N.V. 



K.2 Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



II. Identification of the microorganism: 

fl.1 Accession number given by the International Depositary Authority: 



LMBP 3991 



II. 2 Date of the original deposit (or where a new deposit or a transfer has been 
made, the most recent relevant date) : December 20, 1 999 

III. Viability statement. 

The viability of the microorganism identified under (I above was tested on 

: January 1 1 , 2000 

(Give date. In the cases referred to in Rule 10.2{a)(ii) and {iii), refer to the most recent 
viability test). 

On that date, the said microorganism was: (mark the applicable box with a cross) 

[3 viable 

I I no I nger viable 
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BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS - BCCM™ 
LMBP-COLLECTION 

Page 2 of Form BCCM™/LMBP/BP/9/99-23 Viability statement 



(Fill in if the information has been requested and if the results of the test were 
negative). 



Belgian Coordinated Collections of Microorganisms (BCCM 1M ) 

Laboratorium voor Molecuiaire Biologie - Plasmidencollectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 



Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized official(s): 



IV. 



Conditions under which the viability test has been performed: 



V. 



International Depositary Authority 




Date : January 12, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 



